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ABSTRACT 


Eighteen  trained  panelists  evaluated  the  eating  quality 
of  semimembranosus  (SM)  steaks  using  an  eight-point  category 
scale,  a  15  cm  line  scale  with  anchors  1.3  cm  from  each  end 

and  modulus  free  magnitude  estimation  in  different 

\ 

sequences.  Comparison  of  the  evaluation  techniques  focused 
on  the  descriptive  assessment  of  a  total  of  54  delay-chilled 
(DC)  and  hot-boned  (HB)  cooked  SM  steaks. 

When  panelists  employed  category  scaling  (CS),  the  HB 
steaks  were  rated  lower  (P<0.05)  in  initial  and  overall 
tenderness  and  higher  ( P<0 .01)  in  overall  juiciness  than  DC 
steaks.  For  line  scaling  (LS),  there  were  no  significant 
differences  in  quality  attributes  due  to  postmortem 
treatment.  With  magnitude  estimation  (ME),  the  HB  steaks 
were  less  tender  (P<0.05)  and  more  juicy  (P<0.01)  than  DC 
steaks.  For  evaluations  by  each  technique,  connective 
tissue  amount  and  flavor  intensity  did  not  differ 
significantly  between  the  postmortem  treatments. 

Sensory  assessments  of  tenderness  and  juiciness  were 
generally  supported  by  objective  measurements.  Warner 
Bratzler  shear  values  tended  to  be  higher  for  HB  steaks  than 
for  DC  steaks.  In  addition,  the  HB  steaks  were  visually 
more  (P<0.01)  rare,  had  higher  ( P<0 .001)  Hunter  a  values  and 
greater  ( P<0 .01)  percentage  press  fluid  than  DC  steaks. 
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Treatment  F-values  showed  that  CS ,  with  significant 
F-values  for  initial  tenderness,  overall  tenderness  and 
overall  juiciness,  was  most  sensitive  to  treatment 
differences.  The  LS  technique,  with  no  significant 
F-values,  was  least  sensitive  in  detecting  differences  in 
steak  quality  attributes.  Magnitude  estimation,  with 
significant  F-values  for  overall  tenderness  and  overall 
juiciness,  was  as  sensitive  as  CS  to  most  treatment 
di f  f erences . 

Replication  F-values  for  each  of  the  three  evaluation 
techniques  were  similar  and  nonsignificant  for  most  quality 
attributes.  However,  for  CS,  LS  and  ME,  significant 
( P< 0.001 )  panelist  variation  was  noted  for  each  quality 
attribute.  Except  for  the  ME  assessment  of  overall 
juiciness,  F-values  for  the  treatment  by  panelist 
interaction  were  not  significant. 

Correlation  coefficients  between  each  of  the  evaluation 
procedures  for  initial  and  overall  tenderness  were  highly 
significant  ( P< 0.001 )  and  indicated  strong  linear 
relationships  between  scores  assigned  by  judges  using  CS ,  LS 
and  ME.  The  correlations  between  each  of  the  three 
techniques  for  connective  tissue  amount,  juiciness  and 
flavor  intensity,  while  sometimes  significant,  were  moderate 
to  low. 
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Correlations  of  tenderness  data  from  each  evaluation 
method  with  shear  values  yielded  similar  significant 
r-values  (r>0.63).  Coefficients  of  determination  for 
linear,  power,  logarithmic,  hyperbolic  and  parabolic 
functions  between  panel  tenderness  assessments  by  each 
technique  and  Warner  Bratzler  shear  were  similar.  For 
initial  and  overall  tenderness,  the  power  functions  between 
ME  and  shear  data  yielded  exponents  of  1.13  and  1.05, 
respectively . 

Panelists  found  that  the  CS  and  LS  methods  were  easier 
to  learn  and  required  less  effort  for  sample  evaluation  than 
ME.  In  addition,  CS  was  most  preferred  and  ME  was  least 
preferred  by  panelists. 

In  general,  findings  from  this  study  support  the  use  of 
category  scaling  for  the  descriptive  sensory  assessment  of 
beef.  To  optimize  the  potential  of  the  LS  and  ME  methods 
for  assessments  of  cooked  meat,  some  procedural 
modifications  may  be  needed. 
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INTRODUCTION 


Most  studies  concerned  with  factors  influencing  meat 
quality  (such  as  breeding,  management  or  processing 
treatments)  utilize  sensory  measurements  to  determine  the 
organoleptic  quality  of  cooked  meat  and  to  make  important 
decisions  regarding  all  aspects  of  beef  production, 
processing  and  marketing.  Therefore,  it  is  essential  that 
the  most  reliable  and  accurate  sensory  technique  be  used  to 
evaluate  cooked  meat  quality. 

The  American  Meat  Science  Association  (AMSA,  1978)  has 
suggested  that  category  scales  are  the  most  appropriate 
system  for  evaluating  the  sensory  attributes  of  meat. 

Semi st rue tured  line  scales,  which  form  the  basis  for 
"Quantitative  Descriptive  Analysis",  have  been  used  by 
several  meat  researchers  (Harries  et  al.,  1972;  Bouton  et 
al.,  1980a)  and  have  been  reported  to  be  more  sensitive  to 
product  differences  than  structured  scales  (Baten,  1946). 
However,  information  on  the  sensitivity  of  category  and  line 
scales  in  determining  differences  in  meat  quality  is 
limited . 

Magnitude  estimation,  a  ratio-scaling  technique,  has 
become  popular  for  food  research  (Moskowitz,  1983)  but  has 
not  been  widely  utilized  with  meat  (Larmond,  1976). 

Magnitude  estimation  can  provide  quantitative  information  on 
the  perception  of  meat  tenderness,  which  is  not  possible 
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with  either  category  or  line  scales.  Since  instrumental 
measurements  are  calibrated  in  ratios,  it  may  be 


advantageous  to  also  have  sensory  measures 
(Larmond,  1976).  Thus,  the  feasibility  of 
attribute  of  meat  such  as  tenderness  using 
estimation  requires  investigation. 
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Comparative  studies  exami 
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LITERATURE  REVIEW 


Category  Scaling 

Category  scaling  (CS)  is  routinely  employed  for  the 
descriptive  assessment  of  meat  ( AMSA ,  1978).  For  the  CS 
technique,  panelists  rate  the  intensity  of  a  given  attribute 
on  a  graduated  scale,  usually  seven  to  nine  categories  in 
length . 

Procedures  for  the  assessment  of  meat  by  CS  range  from 
an  elaborate  system  (Cover  et  al.,  1962a,  1962b,  1962c)  for 
quantifying  meat  quality  in  terms  of  softness,  juiciness, 
connective  tissue  and  three  muscle  fiber  attributes  to  an 
evaluation  of  the  single  component  of  tenderness  (Ray  et 
al.,  1982).  A  category  scale  scorecard,  typical  of  that  in 
current  use  by  meat  researchers,  includes  initial  and 
overall  tenderness,  initial  and  overall  juiciness, 
connective  tissue  amount  and  flavor  intensity  (AMSA,  1978). 

There  is  no  consensus  in  the  literature  as  to  the 
optimum  scale  length.  In  studies  of  meat  quality,  the 
length  of  category  scales  ranges  from  five  (McDowell  et  al., 
1982),  to  eight  (Crouse  et  al.,  1983)  to  21  points  (Saveli 
et  al .  ,  1981). 

Psychophysical  research  (Miller,  1956;  Norwich,  1981) 
has  suggested  that  the  optimum  number  of  categories  that  an 
individual  can  manipulate  simultaneously  is  in  the  order  of 
seven  plus  or  minus  two.  While  scales  five  to  nine 
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categories  in  length  have  shown  similar  reliability  (Bendig, 
1953,  1954),  scales  11  categories  in  length  are  less 
reliable  (Bendig,  1953).  For  an  analysis  of  handwriting 
legibility,  Garner  (1960)  reported  that  scale  sensitivity 
increased  as  the  number  of  categories  increased.  However, 
he  (Garner,  1960)  noted  that  the  optimum  number  of 
categories  would  be  a  function  of  the  amount  of 
discr iminability  inherent  in  the  particular  stimuli  under 
investigation . 

Information  on  optimal  scale  length  for  the  sensory 
evaluation  of  food  products  is  limited.  Hedonic  scales 
ranging  in  length  from  five  to  nine  categories  were  compared 
by  Jones  et  al.  (1955)  during  a  food  preference  survey  of 
5,400  soldiers.  Scales  eight  or  nine  categories  in  length 
tended  to  be  more  sensitive  to  differences  in  food 
preference  (as  measured  by  information  transmission)  than 
scales  five  to  seven  categories  in  length. 

Category  scales  for  hedonic  and  intensity  evaluations 
of  food  are  generally  anchored  with  descriptive  terms. 

Verbal  anchoring  has  been  shown  to  increase  the  reliability 
and  sensitivity  of  CS  (Bendig,  1953,  1954).  However,  the 
descriptive  terms  used  as  anchors  must  be  carefully 
selected.  Jones  and  Thurstone  (1955)  investigated  the 
semantics  of  51  descriptive  terms  used  for  anchoring  hedonic 
food  scales  and  identified  terms  with  a  similar  meaning  to  a 
wide  range  of  panelists.  In  a  subsequent  preference  survey 
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(Jones  et  al.,  1955),  20  food  items  were  rated  on  hedonic 
scales,  which  incorporated  suitable  descriptive  terms 
identified  by  Jones  and  Thurstone  (1355).  Discrimination  of 
food  preferences  was  improved  when  the  neutral  descriptor 
"neither  like  nor  dislike"  was  eliminated  (Jones  et  al., 
1955) . 

Raf f ensperger  et  al.  (1956)  developed  a  toughness- 
tenderness  scale  for  beef  by  following  techniques  similar  to 
those  used  by  Jones  and  Thurstone  (1955)  and  Jones  et  al. 
(1955)  for  hedonic  scales.  The  degree  of  ambiguity  for  each 
of  41  descriptive  terms  was  identified.  Then,  a  nine-point 
tender ness-toughness  scale  was  constructed  and  used  in  a 
pilot  study  to  evaluate  nine  beef  roasts.  The  centre 
category  labelled  "neither  tough  nor  tender"  was  avoided  by 
the  panelists.  Subsequently,  removal  of  this  neutral  phrase 
and  a  decrease  in  scale  length  to  eight  points  resulted  in  a 
substantial  reduction  in  systematic  error  along  the  scale 
(Raf f ensperger  et  al.,  1956). 

Results  of  a  recent  survey  by  Cross  (1977),  described 
in  the  American  Meat  Science  Association  Guidelines  for 
Cookery  and  Sensory  Evaluation  of  Meat  (AMSA,  1978),  and  an 
examination  of  the  current  literature  indicate  that  most 
meat  scientists  now  employ  eight-point  CS  (with  no  centre 
neutral  category)  for  the  descriptive  sensory  assessment  of 
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Many  parametric  and  nonparametr ic  statistical 
procedures  have  been  employed  for  CS  data.  The  use  of 
parametric  statistical  procedures  for  data  analysis  is 
considered  valid  only  when  certain  assumptions  are  met  or 
approximated.  For  analyses  of  variance,  these  assumptions 
include:  normality,  independence  and  homogeneity  of  variance 
(Ga i to ,  1980). 

Although  successive  intervals  along  the  category  scale 
may  not  be  equal  in  subjective  magnitude  (Stevens  and 
Galanter,  1957),  integer  values  from  category  scales  are 
generally  used  for  data  analysis  (Chapman  and  Wigfield, 
1970).  Cloninger  et  al.  (1976)  reported  that  there  was 
little  change  to  analyses  of  variance  with  the  normalization 
of  category  data  from  intensity  scales.  Data  from  nine 
sensory  studies  utilizing  scales  varying  from  five  to  15 
points  were  normalized  by  calculating  z-values  to  estimate 
the  position  of  the  categories  on  the  psychological 
continuum.  Analyses  of  transformed  data  from  a  five-point 
scale  resulted  in  an  F-value  of  12.73  rather  than  12.80  for 
non-t ransf ormed  data;  and  for  a  nine-point  scale  of  2.80 
versus  3.08. 

The  popularity  and  continued  use  of  CS  has  been 
attributed  to  its  diversity  and  apparent  simplicity  (Amerine 
et  al.,  1965).  However,  CS  may  have  some  shortcomings. 

Data  from  CS  may  not  be  reproducible  from  one  laboratory  to 
another.  Panelists  may  attempt  to  stretch  or  contract  the 
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scale  to  fit  the  number  of  categories  (Kapsalis  and 
Moskowitz,  1977;  Moskowitz,  1982).  Category  scale  data  also 
are  highly  dependent  upon  the  scale  length  (Kapsalis  and 
Moskow i tz ,  1977). 

Panelists  may  avoid  using  the  ends  of  the  scale  and 
this  "category  end  effect"  could  bias  the  ratings  (O'Mahony, 
1979).  Panelists  also  may  be  more  sensitive  to  sample 
differences  at  the  centre  of  a  scale  than  at  either  end 
(Cloninger  et  al.,  1976).  However,  Stevens  and  Galanter 
(1957)  noted  that  panelists  are  able  to  discriminate  better 
at  one  end  of  the  sensory  continuum  than  at  the  other. 

Finally,  descriptive  anchors  may  confuse  the  judges 
or  may  be  interpreted  differently  by  panelists  (Jones  and 
Thurstone,  1955).  However,  the  careful  selection  of 
descriptive  terms  generally  can  produce  scales  with  similar 
meaning  for  most  panelists  (Jones  et  al.,  1955; 

Raf f ensperger  et  al.,  1956). 

The  widespread  use  of  CS  for  the  descriptive  assessment 
of  meat  indicates  that  this  method  is  considered  to  be  both 
reliable  and  sensitive  to  differences  in  meat  quality. 
However,  few  systematic  studies  of  the  CS  technique  have 
been  conducted.  In  addition,  several  limitations  of  the  CS 
procedure  have  been  identified.  Since  it  is  important  to 
use  the  most  reliable  and  accurate  sensory  technique  for 
determining  meat  quality,  an  evaluation  of  the  CS  procedure 
and  alternative  methods  is  needed. 
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Line  Scaling 

Semi  structured  line  scaling  (LS),  which  forms  the  basis 
for  "Quantitative  Descriptive  Analysis"  (QDA)  (Stone  et  al., 
1974,  1980),  has  been  used  by  several  meat  researchers 
(Harries  et  al.,  1972;  Randall  and  Larmond,  1977;  Bouton  et 
al.,  1980a;  Stanley  et  al.,  1980).  Line  scales  are  usually 
15  cm  in  length  with  verbal  descriptive  anchors  1.3  cm  from 
each  end  (Stone  et  al.,  1974).  Panelists  place  a  vertical 
mark  across  the  horizontal  line  scale  at  the  location  which 
best  reflects  his/her  perception  of  the  intensity  of  the 
attribute  under  evaluation. 

Line-scaling  procedures  used  for  the  descriptive 
assessment  of  meat  vary  considerably.  Modifications  of  the 
QDA  procedure  have  been  used  for  textural  evaluations  of 
beef  (Harries  et  al.,  1972),  ground  beef  (Randall  and 
Larmond,  1977)  and  chicken  (Lyon  and  Klose,  1980).  In  QDA, 
trained  panelists  identify  and  quantify,  in  order  of 
occurrence,  the  quality  attributes  of  a  product.  Several 
researchers  (Walker  et  al.,  1977;  Bouton  et  al.,  1980a)  have 
used  unstructured  (line)  scales  for  tenderness  and  juiciness 
evaluations  of  beef  from  various  postmortem  processing 
treatments . 

Procedures  for  utilizing  LS  have  not  been 
systematically  evaluated.  Some  workers  (Baten,  1946; 

Randall  and  Larmond,  1977;  Stanley  et  al.,  1980)  have 
allowed  direct  comparisons  of  LS  ratings  between  samples. 
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For  each  attribute  under  investigation,  several  samples  were 
scored  either  on  the  same  line  scale  (Randall  and  Larmond, 

1977)  or  on  adjacent  line  scales  (Baten,  1946;  Stanley  et 
al.,  1980).  However,  most  researchers  (Harries  et  al., 

1972;  Stone  et  al.,  1974;  Lyon  et  al.,  1978;  Lyon  and  Klose, 
1980;  Vickers,  1983)  have  used  separate  scorecards  for  the 
evaluation  of  quality  attributes  related  to  each  sample. 

While  most  line  scales  are  15  cm  (6  in)  in  length 
(Baten,  1946;  Randall  and  Larmond,  1977;  Stone  et  al.,  1974, 
1980),  scales  10  cm  long  (Harries  et  al.,  1972;  Giovanni  and 
Pangborn,  1983)  to  18.4  cm  long  (Vickers,  1983)  have  been 
used.  In  addition,  descriptive  anchors  have  been  placed  1.3 
cm  from  each  end  of  the  scale  (Stone  et  al.,  1974,  1980; 

Lyon  and  Klose,  1980),  2.2  cm  from  each  end  (Vickers,  1983) 
or  at  each  end  of  the  scale  (Baten,  1946;  Giovanni  and 
Pangborn,  1983).  Occasionally,  the  line  scale  has  also  been 
anchored  at  its  midpoint  (Stone  et  al.,  1974;  Lyon  et  al., 

1978) .  However,  Stone  et  al.  (1980)  stated  that  their  QDA 
line  scales  were  improved  when  the  middle  anchor  point  was 
eliminated . 

For  data  analysis,  the  location  of  each  response  along 
the  line  scale  is  converted  to  a  numerical  score  which  can 
be  analysed  by  a  variety  of  statistical  procedures.  To 
facilitate  tabulation  of  LS  data  from  QDA,  Stone  et  al. 
(1980)  developed  a  specialized  system  which  incorporated  a 
digitizer  with  visual  display,  a  storage  system,  a 
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microprocessor,  a  printer  and  interface  with  a  main  frame 
computer.  Van  de  Voort  et  al.  (1981)  decribed  a  variation 
of  the  LS  technique  which  allowed  the  direct  transfer  of 
data  from  the  panelist  to  the  computer.  Panelists  scored 
attributes  on  Optical  Mark  Reader  (OMR)  cards  which 
consisted  of  39  "bubbles"  instead  of  the  continuous  line 
scale.  Marked  cards  were  then  read  directly  by  the 
computer.  Despite  the  potential  advantages  of  a  digitizer 
or  OMR  card  system,  many  researchers  continue  to  use  rulers 
or  templates  to  assign  numerical  values  to  LS  data  (Lyon  and 
Klose,  1980;  Stanley  et  al.,  1980;  Vickers,  1983). 


The  LS  procedure  has  some  of  the  same  limitations 
decribed  previously  for  the  CS  technique.  Panelists  may 
avoid  positions  near  the  boundaries  (the  category  end 
effect,  O'Mahony,  1979).  Although  the  line  scale  allows  a 
panelist  to  discriminate  as  finely  as  desired  along  the 
scale,  the  fixed  constraints  at  the  ends  of  the  scale  can 
limit  the  panelist's  freedom  and  generate  biased  data 
(Stevens,  1975).  In  addition,  panelists  tend  to  use  the 
upper  portion  of  the  line  scale  more  than  the  lower  portion 
(Moskowi tz ,  1  983 ) . 


However,  the  LS  technique  possesses  advantages  over  the 
traditional  category  scaling  procedure.  The  LS  procedure 
provides  a  continuous  rather  than  stepwise  scale  (Hall, 

1958)  and  generally  produces  equal  interval  data  (Anderson, 
1970;  Weiss,  1972).  In  addition,  the  line  scale  is  less 
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susceptible  to  numerical  preferences  (Hall,  1958;  Moskowitz, 
1983).  The  reduced  verbal  anchoring  characteristic  of  the 
line  scale  may  also  minimize  the  problem  of  unequal 
intervals  associated  with  the  interpretation  of  descriptive 
terms  (Jones  et  al.,  1955). 

Information  on  the  sensitivity  of  category  and  line 
scales  in  determining  differences  in  meat  quality  is 
limited.  Raf f ensperger  et  al.  (1956)  scored  the  tenderness 
of  beef  cuts  from  different  grades  using  a  nine-point 
category  scale  and  a  modified  line  scale  (nine  divisions 
were  marked)  and  obtained  similar  results  using  each  method. 
However,  for  assessments  of  apple  quality,  Baten  (1946) 
reported  that  the  statistical  analysis  of  data  generated 
from  LS  resulted  in  larger  t-values  than  those  for  data  from 
the  CS  procedure.  Thus,  comparative  studies  of  LS  and  CS 
procedures  are  needed  to  assess  the  relative  merits  of  these 
sensory  evaluation  procedures  for  the  descriptive  sensory 
assessment  of  meat. 

Magnitude  Estimation 

Magnitude  estimation  (ME),  a  ratio-scaling  technique, 
has  become  popular  for  food  research  (Moskowitz,  1983)  but 
has  not  been  widely  utilized  for  meat  (Larmond,  1976).  In 
using  ME,  panelists  assign  numbers  to  stimuli  in  proportion 
to  the  perceived  intensity  of  the  attribute.  The  ratios 
between  subjective  assessments  are  assumed  to  reflect  the 
underlying  ratios  of  perceived  magnitude.  For  example,  a 
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stimulus  judged  '20'  is  perceived  to  be  twice  as  intense  as 
one  judged  '10'.  For  the  first  sample  presented,  the 
experimenter  may  assign  a  certain  value  (fixed  modulus  ME) 
or  may  allow  each  panelist  to  assign  any  value  of  his/her 
choice  (free  modulus  ME). 

Magnitude  estimation  has  been  widely  used  for  intensity 
and  hedonic  assessments  of  taste  in  model  systems 
(Moskowitz,  1970;  Moskowitz  and  Dubose,  1977;  O'Mahony  and 
Heintz,  1981;  Williams  and  Bernhard,  1981;  Johnson  and 
Clydesdale,  1982)  and  in  simple  food  systems  (Cardello  et 
al.,  1979;  DuBose  et  al.,  1980;  Johnson  et  al.,  1982; 
Giovanni  and  Pangborn,  1983).  Textural  attributes  of  foods 
also  have  been  evaluated  using  the  ME  procedure  (Cardello  et 
al.,  1982a,  1982b).  However,  few  studies  have  employed  ME 

for  assessments  of  complex,  heterogeneous  food  systems  such 
as  meat. 

There  is  limited  information  on  procedures  utilizing 
the  ME  technique  for  the  sensory  evaluation  of  meat.  In  an 
early  study,  Segars  et  al.  (1975)  used  17  experienced  judges 
to  score  three  textural  attributes  of  six  beef  muscles  using 
free  modulus  ME.  In  the  ME  evaluation  of  meat  samples, 
panelists  were  instructed  to  assign  larger  numbers  to 
indicate  increased  difficulty  of  cutting  (one  or  two  chews), 
increased  effort  in  chewing  the  meat  and  increased  residue 
at  the  end  of  chewing.  Later,  Segars  et  al.  (1981)  used  two 
slightly  different  modulus  free  ME  procedures  for 
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evaluations  of  irradiated  ham  and  beef  rolls.  In  one  phase 
of  the  study,  panelists  evaluated  four  attributes  in  each 
sample  in  the  series  presented.  In  another  phase,  panelists 
evaluated  a  separate  series  of  samples  for  each  of  three 
attributes  (Segars  et  al.,  1981). 

Prior  to  statistical  analysis,  a  variety  of 
normalization  and  calibration  procedures  can  be  applied  to 
ME  data  to  reduce  panel  variability  due  to  panelists'  choice 
of  numbers.  In  "modulus  equalization"  (Lane  et  al.,  1961), 
the  ratings  from  each  panelists  are  multiplied  by  a 
correction  factor  such  that  each  panelist's  geometric  mean 
is  equal  to  the  group  geometric  mean.  Moskowitz  (1970) 
described  a  normalization  procedure  called  "modulus 
normalization"  in  which  ratings  from  each  panelist  for  a 
core  stimulus  (internal  standard)  or  for  a  series  of  stimuli 
can  be  used  to  adjust  ratings  across  panelists  and  across 
experiments.  In  the  "external  calibration"  procedure 
(Moskowitz,  1977),  panelists  assign  numbers  to  a  series  of 
word  descriptors  in  the  same  manner  that  they  had  used  for 
the  evaluation  of  samples.  A  correction  factor  is  then 
determined  for  each  panelist. 

In  a  collaborative  study  involving  20  laboratories 
(Powers  et  al.,  1981),  the  three  methods  for  normalizing  the 
magnitude  estimates  described  above  were  compared.  Powers 
et  al.  (1981)  concluded  that  the  three  procedures  led  to 
essentially  the  same  results.  Moskowitz  (1977,  1983)  has 
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outlined  examples  of  how  these  normalization  procedures  can 
be  implemented. 

Magnitude  estimation  data  can  be  analysed  by  any 
appropriate  statistical  procedure.  However,  magnitude 
estimates  are  distributed  log-normally  (Marks,  1974)  with 
the  majority  of  the  numbers  at  the  lower  end  of  the  number 
range . 

Logarithmic  transformations  are  frequently  performed  to 
change  the  distribution  of  the  data  from  a  log-normal  to  a 
normal  distribution,  a  requirement  of  parametric  statistical 
procedures  (Moskowitz  and  Sidel,  1971;  Powers  et  al.,  1981; 
Garc ia-Medina ,  1981;  Giovanni  and  Pangborn,  1983).  For  ME 
data,  the  geometric  mean  is  the  preferred  average  as  it 
preserves  the  ratio  between  the  estimates  and  is  not 
excessively  influenced  by  an  occasionally  large  estimate 
(Marks,  1974).  However,  logarithmic  transformations  of  ME 
data  may  not  always  be  appropriate.  When  panelists  assign 
zero  ratings,  logarithms  can  not  be  used  and  geometric  means 
can  not  be  calculated  (Moskowitz,  1977).  Moskowitz  (1983) 
reported  that  similar  conclusions  were  reached  from  analyses 
of  both  nontransf ormed  and  transformed  data. 

Several  biases  of  the  ME  technique  have  been 
identified.  Most  panelists  operate  within  a  limited  range 
of  familiar  numbers  and  show  aberrant  behavior  in  scaling 
very  large  or  small  stimuli  (Moskowitz,  1983).  Thus, 
several  researchers  (Howard,  1973;  Williams  and  Bernhard, 
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1981)  have  suggested  that  for  ME  experiments,  extremes  in 
sensory  magnitude  should  be  avoided. 

The  "regression  effect"  is  another  common  bias  of  ME 
data  (Moskowitz,  1983).  When  assigning  ratings  to  reflect 
intensity,  panelists  show  conservative  behavior  which  could 
lead  to  poorer,  less  discriminating  data.  An  additional 
limitation  of  ME  is  that  panelists  prefer  to  use  round 
numbers  more  than  odd  unusual  numbers.  O’Mahony  and  Heintz 
(1981)  used  fixed  modulus  ME  to  measure  the  perceived 
intensity  of  salt  solutions  and  found  that  more  than  75  %  of 
the  estimates  were  simple  multiples  of  the  standard  10  or 
unit  values  from  0  to  5.  A  partial  remedy  of  this  bias  has 
been  the  use  of  modulus  free  ME  (Moskowitz,  1983). 

It  is  unclear  whether  or  not  the  ME  technique  produces 
numbers  that  have  true  ratio  scale  properties  or  whether 
this  method  produces  nothing  more  than  a  disguised  internal 
type  measurement  (O'Mahony  and  Heintz,  1981;  Birnbaum, 

1982) .  However,  proponents  of  ME  (Stevens  and  Galanter, 
1957;  Marks,  1974;  Moskowitz,  1982,  1983)  have  described 
numerous  studies  confirming  the  validity  and  reliability  of 
this  and  other  ratio-scaling  procedures.  Concerns  about  the 
validity  of  the  ME  procedure  have  not  been  fully  resolved 
and  continue  to  be  expressed  in  the  psychological  literature 
(Weiss,  1981;  McBride,  1983;  Mellers,  1983). 

However,  ME  has  several  advantages  over  the  CS  and  LS 
procedures.  Because  the  judge  is  unconstrained  as  to  the 
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range  and  size  of  numbers  that  can  be  used,  the  magnitude 
scale  is  not  truncated  at  the  extremes  of  sensory  magnitude 
(Moskowitz  and  Sidel,  1971).  In  addition,  ME  data  can  be 
expressed  as  ratios  or  percentages  (Moskowitz,  1975).  Thus, 
magnitude  estimates  provide  quantitative  information  about 
how  panelists  perceive  different  stimuli  while  data  from  CS 
and  LS  only  convey  information  about  intervals  (Moskowitz, 

1  983)  . 

Workers  in  psychometrics  and  psychophysics  (Stevens  and 
Galanter,  1957;  Marks  and  Cain,  1972)  have  found  that  for 
many  perceptual  continua,  a  power  function  S=kln?  relates 
sensory  intensity,  S,  to  physical  intensity,  I.  In  log-log 
coordinates,  the  power  function  becomes  a  line 

log  S  =  n  log  I  +  log  k 

with  slope  'n'  and  intercept  /k/.  Much  attention  has  been 
directed  to  the  exponent  (slope  'n')  of  each  continuum 
because  the  exponent  is  an  index  of  perceptual  sensitivity 
and  governs  the  rate  at  which  perceived  intensity  increases 
with  physical  magnitude  (Moskowitz  et  al.,  1974). 

Since  instrumental  measurements  are  calibrated  in 
ratios,  it  may  be  advantageous  to  also  have  sensory  measures 
on  a  ratio  scale.  In  addition,  equations  relating  the 
textural  properties  of  meat  assessed  inst rumentally  to 
texture  judged  by  panelists  would  be  a  very  valuable  tool 
for  meat  scientists.  However,  few  studies  in  meat  research 
have  employed  ME.  Segars  et  al.  (1975)  reported  that 
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exponents  relating  textural  properties  of  meat  to  an 
instrumental  measure  of  tenderness  (punch  and  die  shear 
device)  ranged  from  1.8  to  2.6.  These  exponents  indicate 
that  the  panel  was  much  more  sensitive  than  the  machine  in 
detecting  textural  differences.  A  two-fold  increase  in  an 
instrumental  property  was  perceived  as  a  3.5-  to  six-fold 
increase  in  the  sensory  attribute.  Because  ME  values  could 
be  assigned  without  contraction  of  the  scale  at  the  extremes 
of  the  texture  range,  sensory  data  paralleled  the 
instrumental  measurements  (Segars  et  al.,  1975).  Thus,  the 
potential  of  ME  for  the  descriptive  sensory  assessment  of 
meat  requires  further  investigation. 

In  addition,  few  studies  have  compared  ME  to  either  CS 
or  LS  for  the  descriptive  assessment  of  food  attributes. 
McDaniel  and  Sawyer  (1981a)  used  nine-point  CS  and  modulus 
free  ME  to  score  the  intensity  of  19  descriptive  profile 
terms  of  whiskey  sour  formulations.  The  ME  technique 
yielded  a  similar  number  of  significant  differences  to  CS 
(McDaniel  and  Sawyer,  1981a).  However,  Giovanni  and 
Pangborn  (1983)  obtained  larger  treatment  F-values  for 
panelists'  evaluations  of  intensities  of  fat  in  milk  and 
sucrose  in  lemonade  by  the  LS  technique  than  by  ME.  Further 
comparative  studies  of  CS ,  LS  and  ME  procedures  are  needed. 

Comparative  Sensory  Methodology 

Many  early  studies  on  taste  panel  methodology  have 
focused  on  sensory  tests  such  as  the  paired  comparison, 
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ranking,  rating  and  scoring  (Baten,  1946;  Murphy  et  al., 
1954,  1957;  Pilgrim  and  Wood,  1955;  Raf f ensperger  et  al., 
1956;  Filipello,  1957;  Gridgeman,  1961).  In  recent  years, 
there  has  been  a  renewed  interest  in  the  comparison  of 
different  sensory  evaluation  techniques  used  for  intensity 
and  hedonic  assessments  of  food  products  (Moskowitz  and 
Sidel,  1971;  McDaniel  and  Sawyer,  1981a,  1981b;  Moskowitz, 
1982;  Giovanni  and  Pangborn,  1983;  Vickers,  1983). 

Although  the  sensory  evaluation  techniques  examined  by 
researchers  differ  considerably,  in  general,  comparative 
studies  of  sensory  techniques  have  used  similar  procedures 
and  evaluation  criteria.  In  most  studies,  treatments  were 
selected  to  provide  differences  in  the  attribute/product 
under  investigation.  For  example,  Murphy  et  al.  (1957)  used 
three  different  strawberry  varieties,  Gridgeman  (1961) 
stored  eggs  under  four  conditions  and  McDaniel  and  Sawyer 
(1981a,  1981b)  prepared  a  number  of  different  whiskey  sour 
formulations . 

In  several  studies  (Pilgrim  and  Wood,  1955;  Gridgeman, 
1961;  Moskowitz  and  Sidel,  1971),  different  groups  of 
panelists  were  used  for  evaluations  by  each  sensory 
technique.  However,  most  researchers  (Baten,  1946;  Murphy 
et  al.,  1954,  1957;  Carlin  et  al.,  1956;  Filipello,  1957; 
McDaniel  and  Sawyer,  1981a,  1981b;  Giovanni  and  Pangborn, 
1983;  Vickers,  1983)  employed  one  group  of  panelists  to 
compare  sensory  procedures.  Gridgeman  (1961)  stated  that  an 
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ideal  experimental  comparison  would  involve  the  same  group 
of  panelists  for  each  of  the  sensory  techniques  under 
investigation . 

In  an  early  comparative  study  (Murphy  et  al.,  1957), 
panelists  used  three  sensory  evaluation  techniques  on  the 
same  day  (and  possibly  at  the  same  session).  However,  to 
minimize  possible  memory  influences,  training  and 
motivational  factors,  various  procedures  have  been  employed. 
Carlin  et  al.  (1956)  presented  two  scoring  techniques  on 
alternate  days.  McDaniel  and  Sawyer  (1981a,  1981b)  employed 
CS  and  ME  techniques  on  different  days  according  to 
randomized  designs.  Recently,  balanced  designs  were  used 
for  a  comparison  of  LS  and  ME  techniques  (Giovanni  and 
Pangborn,  1983;  Vickers,  1983).  In  both  of  these  studies, 
half  of  the  panelists  employed  LS  first,  while  the  remaining 
panelists  began  with  ME. 

Statistical  procedures  used  to  quantitatively  compare 
sensory  evaluation  methods  which  have  different  scales  of 
measurement  are  lacking  (Murphy  et  al.,  1957).  However, 
criteria  which  have  been  used  to  compare  sensory  evaluation 
techniques  include: 

(1)  the  number  of  significant  differences  among  the 
treatments  (Murphy  et  al.,  1954,  1957;  Pilgrim  and  Wood, 
1955;  Moskowitz  and  Sidel,  1971;  Moskowitz,  1982;  McDaniel 
and  Sawyer,  1981a,  1981b) 

(2)  the  magnitude  and  significance  of  the  F-value  or  t-value 
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from  statistical  analyses  of  each  evaluation  technique 
(Baten,  1946;  Murphy  et  al.,  1954;  Gridgeman,  1961; 

Dransf ield  et  al.,  1982;  Moskowitz,  1982;  Giovanni  and 
Pangborn,  1983) 

(3)  correlation  coefficients  (Filipello,  1957;  Dransfield  et 
al.,  1982;  Giovanni  and  Pangborn,  1983),  and 

(4)  the  sensitivity  of  the  judges  (Carlin  et  al.,  1956; 
Murphy  et  al.,  1957;  Giovanni  and  Pangborn,  1983).  In 
addition,  several  researchers  (Baten,  1946;  Murphy  et  al., 
1957)  have  examined  panelist  preference  among  the  sensory 
evaluation  techniques  employed. 

Some  criteria  for  examining  sensory  methods  have  been 
specific  to  comparative  studies  of  CS,  LS  and  ME  procedures. 
For  CS  and  ME  data,  relationships  between  hedonic  or 
intensity  evaluations  and  physical  concentration  have  been 
plotted  (Moskowitz  and  Sidel,  1971)  or  expressed 
mathematically  (Moskowitz,  1982).  In  addition,  Giovanni  and 
Pangborn  (1983)  have  tested  the  "goodness  of  fit"  for 
mathematical  models  between  LS  and  ME  data  and  between 
sensory  data  and  physical  concentrations  by  examining  the 
resulting  coefficients  of  determination.  Anderson  (1977) 
and  Coleman  et  al.  (1981)  have  suggested  that  the  "goodness 
of  fit"  of  a  mathematical  function  should  be  tested  by 
measuring  the  significance  of  the  deviations  from  the  model 
by  analysis  of  variance.  However,  several  workers  (O’Mahony 
and  Heintz,  1981;  Powers  et  al.,  1981)  have  used  correlation 
coefficients  to  test  mathematical  models  for  ME  data. 
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Comparative  studies  of  CS ,  LS  and  ME  procedures  have 
focused  on  either  hedonic  assessments  of  food  products 
(Moskowitz  and  Sidel,  1971;  McDaniel  and  Sawyer,  1981b; 
Moskowitz,  1982;  Vickers,  1983),  or  on  intensity  evaluations 
of  food  attributes  (McDaniel  and  Sawyer,  1981a),  or  both 
(Giovanni  and  Pangborn,  1983).  However,  few  studies  using 
trained  panelists  have  compared  CS,  LS  and  ME  for  the 
descriptive  assessment  of  food  attributes.  McDaniel  and 
Sawyer  (1981a)  used  a  trained  panel  to  score  the  intensity 
of  19  descriptive  profile  terms  of  whiskey  sour  formulations 
using  CS  and  ME.  In  addition,  studies  comparing  sensory 
procedures  for  the  descriptive  assessment  of  meat  are 
lacking. 

Raf f ensperger  et  al.  (1956)  used  a  nine-point  category 
scale  and  a  modified  line  scale  (nine  divisions  were  marked) 
to  score  the  tenderness  of  beef  cuts  from  different  grades. 
The  evaluation  techniques  were  compared  by  examining  the 
discr iminability  of  the  scales,  agreement  among  judges  and 
distributions  of  the  ratings.  Scale  sensitivity  and 
panelist  agreement  did  not  differ  between  scaling 
techniques;  however,  the  CS  procedure  had  a  significantly 
greater  amount  of  systematic  error  than  the  LS  procedure 
(Raf f ensperger  et  al.,  1956). 

The  eating  quality  of  beef  assessed  at  each  of  five 
European  research  institutes  was  compared  by  utilizing  a 
common  eight-point  category  scale  (Dransfield  et  al.,  1982). 
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Criteria  used  to  compare  data  from  each  of  the  research 
institutes  included:  correlation  of  sensory  data,  panel 
discrimination  of  tenderness  differences,  panel  variation 
between  institutes  and  an  examination  of  the  sensory  data. 
In  addition,  instrumental  measures  of  tenderness  (Warner 
Bratzler  shear  and  Volodkevich-style  jaws)  were  correlated 
with  panel  tenderness  scores.  Assessments  of  tenderness  by 
the  experienced  panels  were  highly  correlated  across 
institutes.  Instrumental  measures  of  tenderness  also 
correlated  highly  with  panel  tenderness  scores.  Panelists 
at  one  research  institute  did  not  discriminate  tenderness 
differences  as  well  as  panelists  at  the  other  four  research 
institutes  (Dransfield  et  al.,  1982). 

The  procedures  and  evaluation  criteria  identified  in 
the  preceding  discussion  may  be  useful  for  a  comparative 
study  of  CS,  LS  and  ME  for  the  descriptive  sensory 
assessment  of  beef. 

Training 

Meat  scientists  routinely  employ  trained  descriptive 
panels  to  determine  the  effects  of  various  production  and 
processing  treatments  on  the  quality  attributes  of  cooked 
meat.  However,  recommendations  regarding  the  extent  of 
panelist  training  for  CS,  LS  and  ME  differ.  An  "expert" 
descriptive  panel  using  CS  may  receive  three  to  four  months 
of  intensive  training  ( AMSA ,  1978);  panelists  using  LS  for 

assessments  of  food  usually  receive  about  10  hours 
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of  training  (Zook  and  Wessman,  1977).  With  ME ,  it  is  felt 
that  a  conscientious  person  can  make  quantitative  estimates 
of  quality  attributes  in  foods,  after  a  few  introductory 
sessions  (Moskowitz,  1977;  Williams  and  Bernhard,  1981). 
However,  use  of  ME  by  trained  rather  than  untrained  panels 
may  result  in  improved  discrimination  of  textural  attributes 
in  foods  (Cardello  et  al.,  1982a).  In  addition,  practice  in 
the  use  of  ME  may  result  in  more  stable  judgements 
(Teghtsoonian  and  Teght soon ian ,  1971;  Giovanni  and  Pangborn, 

1983) . 

Cross  et  al.  (1978)  and  the  AMSA  (1978)  outlined 
selection  and  training  procedures  for  the  descriptive 
sensory  assessment  of  beef  which  have  now  been  adopted  by 
many  meat  researchers  (Kastner  et  al.,  1980;  Davis  et  al., 
1981;  Berry  and  Cross,  1982;  Bowles  Axe  et  al.,  1983;  Crouse 
et  al.,  1983).  Three  objectives  of  training  were 
identified:  (1)  to  familiarize  the  panelist  with  the  test 
procedures,  (2)  to  improve  a  panelist's  ability  to  recognize 
and  identify  sensory  attributes,  and  (3)  to  improve  the 
panelist's  sensitivity  and  memory  (Cross  et  al.,  1978;  AMSA, 
1978).  In  the  study  by  Cross  et  al.  (1978),  training  was 
accomplished  through  individual  and  group  sessions  in  which 
a  wide  range  of  beef  samples  were  evaluated  and  discussed. 
One  or  more  sessions  were  devoted  toward  demonstrating 
levels  of  each  attribute  under  study.  Although  the  training 
procedure  described  by  Cross  et  al.  (1978)  was  developed  for 
the  CS  technique,  this  procedure  (with  slight  modifications) 
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could  also  be  effectively  used  to  train  panelists  in  the  use 
of  LS  and  ME. 

Choice  of  the  Meat  System 

Tenderness  is  probably  the  most  important  single 
quality  attribute  of  meat  (Bailey,  1972).  Considerable 
current  research  has  focused  on  the  effects  of  various 
postmortem  processing  treatments  such  as  hot  boning,  delay 
chilling  and  electrical  stimulation  on  the  resulting 
tenderness  and  on  other  quality  attributes  of  cooked  beef. 
Traditionally,  carcasses  are  left  intact  after  slaughter 
with  muscles  attached  and  restrained  by  the  skeleton, 
chilled  for  18  to  48  h  at  2  to  3°C  and  then  fabricated  and 
sold  as  sides,  quarters,  wholesale  cuts,  subprimal  cuts  or 
boneless  cuts  (Cross,  1979). 

In  hot  boning  (HB),  muscles  or  muscle  systems  are 
removed  from  the  carcass  prior  to  chilling  (West,  1983). 
Because  HB  requires  the  chilling  and  storage  of  only  edible 
meat  and  not  excess  fat  and  bone,  potential  economic 
benefits  include:  savings  in  energy,  cooler  space,  labor, 
transportation  costs,  product  shrinkage  and  in-plant 
residence  time  (Kastner  et  al.,  1973;  Ray  et  al.,  1982; 
Kastner,  1983).  However,  prerigor  excision  and  rapid 
chilling  may  cause  toughening  (Kastner  et  al.,  1973;  Cross 
and  Tennent,  1980;  Berry  and  Cross,  1982;  Bowles  Axe  et  al., 
1983;  Lyon  et  al.,  1983).  Although  flavor  is  generally  not 
affected  by  HB  (Schmidt  and  Keman,  1974;  Kastner  and 
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Russell,  1975;  Dransfield  et  al.,  1976),  the  connective 
tissue  of  hot-boned  meat  is  sometimes  perceived  as  more 
abundant  than  that  in  comparable  meat  from  conventionally- 
boned  carcasses  (Berry  and  Cross,  1982;  Bowles  Axe  et  al., 
1983) . 

Delay  chilling  (DC)  is  an  early  postmortem  conditioning 
treatment  in  which  carcasses  are  held  at  elevated 
temperatures  (7  -  16°)  for  8  to  24  h  postmortem  (Cliplef  and 
Strain,  1976;  Smith  et  al.,  1979a;  Crouse  et  al.,  1983).  In 
several  studies  (Hostetler  et  al.,  1975;  Cliplef  and  Strain, 
1976;  Smith  et  al.,  1979a;  Lochner  et  al.,  1980),  meat  from 
delay-chilled  carcasses  was  more  tender  than  that  from 
conventionally-processed  carcasses.  Recent  reports  (Lochner 
et  al.,  1980;  Marsh  et  al.,  1980-81)  have  suggested  that  the 
enhanced  tenderness  of  delay-chilled  beef  was  primarily  due 
to  the  high  muscle  temperature  and  pH  (Marsh  et  al., 

1980-81)  maintained  during  the  first  2  to  4  h  postmortem. 

In  addition,  DC  may  also  improve  meat  flavor  (Cliplef  and 
Strain,  1976)  and  reduce  detectable  connective  tissue 
(Crouse  et  al.,  1983). 

Electrical  stimulation  (ES)  of  beef  involves  the 
application  of  an  electrical  current  soon  after  slaughter  to 
accelerate  the  rate  of  glycolysis  and  reduce  the  onset  of 
rigor  mortis  (Will  et  al.,  1979).  The  use  of  ES  immediately 
post  slaughter  may  prevent  toughening  of  hot-boned  meat 
(Gilbert  and  Davey,  1976;  Walker  et  al.,  1977;  Bouton  et 
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al.,  1980a;  Bowles  Axe  et  al.,  1983).  However,  several 
workers  (Taylor  et  al.,  1980-81;  Lyon  et  al.,  1983)  have 
found  that  ES  did  not  improve  the  tenderness  of  hot-boned 
meat . 

Frequently,  ES  has  been  shown  to  increase  the 
tenderness  of  conventionally-chilled  carcasses  (Bouton  et 
al.,  1978,  1980a,  1980b;  Smith  et  al.,  1979b;  Cross  and 
Tennent,  1980;  George  et  al.,  1980;  McKeith  et  al.,  1980, 
1981;  Riley  et  al.,  1983).  However,  ES  may  be  more 
effective  in  improving  the  tenderness  of  tougher  meat,  such 
as  that  from  forage-fed  beef  (Davis  et  al.,  1981;  Salm  et 
al.,  1981;  Schroeder  et  al.,  1982).  Improvements  in  flavor 
and  in  other  meat  quality  attributes  due  to  ES  have 
occasionally  been  reported  (Smith  et  al.,  1979b,  McKeith  et 
al.,  1980). 

There  is  limited  research  on  the  ES  of  beef  carcasses 
in  conjunction  with  DC.  However,  Smith  et  al.  (1979b)  and 
Elgasim  et  al.  (1981)  reported  that  ES  contributed  more  to 
increased  tenderness  of  the  meat  cuts  than  did  DC. 

Thus,  these  various  postmortem  processing  treatments 
(HB,  DC,  ES)  should  provide  meat  with  a  wide  range  in 
organoleptic  quality  useful  for  a  comparative  study  of 
sensory  evaluation  techniques.  In  addition,  the  meat 
quality  differences  attributable  to  postmortem  treatment  and 
determined  from  sensory  and  objective  measurements,  should 
be  of  a  magnitude  similar  to  those  found  in  current  research 
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practice . 

The  longissimus  (L)  muscle  is  a  popular  muscle  for 
studies  on  meat  quality.  However,  because  muscles  may  vary 
in  their  response  to  different  treatments  (Hawrysh  and  Berg, 
1975),  additional  information  is  needed  on  the  effects  of 
various  postmortem  processing  treatments  on  other  major 
bovine  muscles  such  as  the  semimembranosus  (SM), 
semi tendinosus  (ST)  and  biceps  femoris  (BF).  The  SM  is  a 
large,  easily  accessible  muscle  which  could  provide  a  number 
of  similar  steaks.  Because  the  SM  muscle  has  displayed  a 
slight  end-to-end  variation  in  tenderness  (Paul  and 
Bratzler,  1955;  Ginger  and  Weir,  1958;  Dransfield  and  Jones, 
1978),  sampling  from  the  same  anatomical  location  from  each 
muscle  is  important.  Balancing  of  steak  position  from 
adjacent  steaks  would  further  minimize  any  differences  in 
tenderness  due  to  position. 

Choice  of  the  Cooking  System 

Two  dry  heat  methods  described  in  the  literature  for 
cooking  SM  steaks  are  roasting  (Batcher  and  Deary,  1975; 
Berry  et  al.,  1977;  McKeith  et  al.,  1981)  and  broiling 
(Breidenstein  et  al.,  1968;  Dryden  et  al.,  1979;  Smith  et 
al.,  1982).  In  roasting,  heat  is  transmitted  to  the  meat  by 
convection,  either  by  normal  or  forced  air ,  in  a  closed 
preheated  oven  ( AMSA ,  1978).  The  meat  is  not  turned  during 

cooking.  In  broiling,  steaks  are  cooked  primarily  by 
radiant  energy  (Paul  and  Palmer,  i972).  The  heat  usually 
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radiates  from  one  direction,  so  the  meat  must  be  turned 
during  cooking  ( AMSA ,  1978). 

Although  broiling  more  closely  simulates  current 
consumer  practice,  Harrison  (1975)  recommended  roasting 
instead  of  broiling  for  tender  meat  cuts  as  precise  broiling 
conditions  are  difficult  to  maintain.  Radiation  intensity 
varies  within  and  among  broiler  units  (Paul  and  Palmer, 

1972;  Batcher  and  Deary,  1975).  In  addition,  the  air 
temperature  of  the  oven  can  be  influenced  by  air  movement  in 
the  laboratory  (Cover  et  al.,  1957)  and  may  not  reflect  the 
true  energy  output  of  the  unit  (Paul  and  Palmer,  1972).  In 
broiling,  the  investigator  relies  on  the  final  internal 
temperature  of  the  steaks  as  his  only  control  (AMSA,  1978). 

Several  workers  (Batcher  and  Deary,  1975;  Cross  et  al., 
1979)  have  compared  the  characteristics  of  beef  steaks 
cooked  by  roasting  and  broiling.  When  cooked  to  either  60° 
or  71°,  roasted  SM  steaks  were  scored  more  juicy  and  tender 
and  less  mealy  than  comparable  steaks  broiled  to  the  same 
temperature  (Batcher  and  Deary,  1975).  In  addition,  cooking 
losses  were  significantly  lower  for  roasted  than  broiled 
steaks  (Batcher  and  Deary,  1975),  which  would  result  in 
increased  edible  portion.  Other  advantages  of  roasting  over 
broiling  described  by  Batcher  and  Deary  (1975)  included: 
less  splatter,  lower  energy  consumption  and  minimal  effort 
and  attention  for  the  researcher.  During  preliminary  work, 
it  was  noted  that  roasted  3.8  cm  SM  steaks  were  also  more 
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even  in  doneness  than  broiled  steaks  (Shand  and  Hawrysh, 
unpublished) . 

Cross  et  al.  (1979)  found  that  either  roasting  or 
broiling  was  suitable  for  2.5  cm  thick  longissimus  steaks. 
Tenderness,  connective  tissue  amount  and  flavor  scores  for 
steaks  did  not  differ  significantly  between  cooking  methods. 
However,  broiled  steaks  were  less  juicy  and  had  greater 
cooking  losses  than  roasted  steaks  (Cross  et  al.,  1979). 

Cooking  methods  selected  for  meat  research  should 
maximize  the  differences  due  to  factors  under  investigation 
and  minimize  variability  due  to  cooking  (Mottram,  1981). 
Thus,  roasting  of  SM  steaks  appears  to  be  an  appropriate  and 
reproducible  cooking  method  to  use  in  a  comparative  study  of 
sensory  evaluation  procedures. 
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EXPERIMENTAL  PROCEDURE 


Experimental  Design  and  Statistical  Analysis 

Eighteen  trained  panelists  evaluated  the  quality 
attributes  of  semimembranosus  steaks  using  three  sensory 
evaluation  techniques  (category  scaling  (CS),  line  scaling 
(LS)  and  magnitude  estimation  (ME)).  Semimembranosus  (SM) 
steaks  were  evaluated  according  to  an  experimental  design 
which  involved  four  treatments,  nine  replications  (roasts) 
per  treatment,  three  steaks  per  roast,  three  phases  (time 
periods)  and  18  judges.  The  design  for  one  experimental 
replication  is  outlined  in  Figure  1. 

The  three  adjacent  steaks,  cut  from  each  of  nine  roasts 
per  treatment,  were  assigned  according  to  three  3x3  Latin 
square  designs  for  preparation  in  each  of  three  phases. 

Each  phase  involved  the  evaluation  of  three  steaks  taken 
from  each  of  the  three  roast  positions. 

To  minimize  the  effects  of  scale  carry  over,  learning 
and  motivational  factors,  panelists  utilized  each  sensory 
evaluation  technique  sequentially,  in  different 
predetermined  orders,  in  the  three  phases  of  the  study 
(Figure  1).  Within  each  phase,  samples  from  each  steak  were 
evaluated  by  the  18  trained  panelists  with  six  of  the  judges 
using  the  CS  procedure,  six  other  judges  using  the  LS 
procedure  and  the  remaining  six  judges  using  ME.  At  the 
completion  of  the  study,  each  judge  had  evaluated  one  of 
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Figure 
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three  adjacent  steaks  from  each  of  the  nine  roasts  per 
treatment  using  each  evaluation  technique. 

Sample  presentation  to  panelists  was  completely 
balanced  for  serving  order,  judge  and  replication  as 
described  by  Sidel  and  Stone  (1976).  However,  for  each 
phase  of  the  study,  serving  order  for  each  panelist  remained 
the  same  to  allow  panelist  evaluations  with  each  technique 
to  be  similar. 

To  balance  any  treatment  order  effects  for  each 
replication,  all  preparation,  sampling  and  objective 
measurements  made  on  the  steaks  were  performed  according  to 
complete  and  incomplete  Latin  square  designs. 

Arithmetic  means  were  calculated  for  CS  and  LS  data; 
geometric  means  were  calculated  for  ME.  Least  squares 
split-plot  analyses  of  variance  were  computed  for  the 
various  factors  under  investigation.  Sources  of  variation 
were  treatment  (n=4),  replications  (n=9),  phases  (n=3)  and 
steaks  (n  =  3)  and  for  taste  panel  data,  panelists  ( n  = 1 8 ) . 

For  all  analyses  with  more  than  one  observation  per  cell, 
means  were  computed  and  used  in  the  analyses.  Student- 
Newman-Keuls ?  multiple  range  test  (Steel  and  Torrie,  1980) 
was  used  to  establish  significant  differences  among 
treatments . 


Data  for  each  sensory  evaluation  technique  were 
analysed  separately.  The  magnitude  estimates  were 
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transformed  to  logarithms  for  data  analyses  unless  otherwise 
noted.  However,  it  was  not  necessary  to  normalize  ME  data 
as  described  by  Powers  et  al.  (1981)  because  panelist 
variation  was  removed  in  the  analyses  of  variance  procedures 
ut i 1 ized . 

Initially,  the  four  postmortem  treatments  evaluated 
were  included  in  all  analyses  of  sensory  and  objective  data. 
However,  two  treatments  had  few  significant  differences  and 
contributed  little  to  a  comparison  of  sensory  evaluation 
techniques,  the  main  interest  of  this  study.  Therefore, 
analyses  were  performed  on  data  from  only  two  (delay 
chilling  and  hot  boning)  of  the  four  postmortem  processing 
treatments.  The  sources  of  variation  and  valid  error  terms 
used  in  the  analysis  of  data  from  each  sensory  evaluation 
procedure  are  presented  in  Table  1 . 

Correlation  and  regression  analyses  were  performed  to 
assess  linear  and  other  mathematical  functions  for 
appropriate  sensory  data  between  each  of  the  three 
evaluation  techniques.  In  addition,  for  tenderness 
assessments,  correlation  and  regression  analyses  were 
performed  between  each  of  the  evaluation  techniques  and 
Warner  Bratzler  shear  data.  The  functions  evaluated 
included:  linear  (Y=a+bX) ,  power  (Y=aXn),  logarithmic 
(Y=a+blnX),  hyperbolic  (Y=a+b(1/X))  and  parabolic 
( Y=a+bX+cX2 ) .  For  evaluation  of  the  non-linear  functions, 
appropriate  transformations  of  the  data  were  made  and  linear 
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Table  1.  Sources  of  variation,  degrees  of  freedom  and  valid 
error  terms  for  the  analyses  of  variance  of  data 
from  each  sensory  evaluation  procedure. 


Degrees  of  Freedom1 

Source  of 

Two  Four 

Valid  Error 

Variation 

Treatments  Treatments 

Term 

Treatment  (T) 

1 

3 

TR 

Replication  (R) 

8 

8 

TR 

TR  (T) 

8 

24 

—  — 

Phase  (P) 

2 

2 

— 

Judge/Phase  (J/P)2 

15 

15 

RP+RJ/P+TRP+TRJ/P 

TP 

2 

6 

— 

TJ/P3 

15 

45 

RP+RJ/P+TRP+TRJ/P 

RP 

16 

16 

— 

RJ/P 

120 

120 

TRP+TRJ/P 

TRP 

16 

48 

— 

TRJ/P 

120 

360 

—  — 

Total 

323 

647 

’Actual  degrees  of  freedom  for  each  of  the  analyses  may  be 
reduced  because  of  missing  data. 


2Used  as  panelist  source  of  variation. 

3Used  as  treatment  by  panelist  source  of  variation. 
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regressions  were  computed.  Ail  correlations  and  regressions 
were  computed  across  treatments  rather  than  within 
treatments  to  allow  an  evaluation  of  relationships  over  the 
entire  range  of  the  data. 

For  the  three  evaluation  techniques,  the  ranked  data 
for  selected  criteria  were  converted  to  scores  according  to 
the  method  of  Fisher  and  Yates  (1949)  as  described  by 
Larmond  (1977)  and  then  submitted  to  analyses  of  variance. 
Mean  separation  of  ranks  was  done  using  Student-Newman- 
Keuls'  multiple  range  test  (Steel  and  Torrie,  1980). 

Meat  Used  for  the  Study 

Thirty-six  SM  roasts  of  similar  weight,  nine  from  each 
of  four  postmortem  processing  treatments,  were  obtained  from 
young  crossbred  steers  (average  age  of  15  months)  raised  at 
the  Agriculture  Canada  Research  Branch,  Lacombe,  Alberta. 
Details  of  breeding,  management  and  postmortem  handling  of 
the  animals  were  described  by  Jeremiah  et  al.  (1984). 

Four  postmortem  processing  treatments  were  applied  to 
the  carcass  sides:  delay  chilling  (DC),  delay  chilling  with 
electrical  stimulation  (DC-ES),  hot  boning  (HB)  and  hot 
boning  with  electrical  stimulation  (HB-ES). 

Carcass  sides  assigned  to  the  DC  treatments  were  held 
for  2  h  in  a  walk-in  meat  cooler  at  10-15°  before  being 
chilled  at  2°.  The  SM  muscles  from  the  sides  assigned  to 
the  HB  treatments  were  excised  at  40  min  postmortem. 
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Electrical  stimulation  of  delay-chilled  sides  and 
hot-boned  SM  muscles  followed  similar  procedures.  At  40  min 
postmortem,  sides  or  excised  muscles  were  electrically 
stimulated  (550  V,  AC  50-60  cycles  per  sec.  2  min  duration 
with  3  sec  on,  3  sec  off)  using  a  Best  and  Donovan  No.  2333 
hog  stunner  attached  to  a  HJ8015  Juno  multimeter. 

The  sides  or  excised  muscles  were  aged  for  six  days 
(2°).  A  roast,  taken  from  the  distal  end  of  each  SM  muscle, 
was  individually  vacuum-packed  in  a  barrier  bag  and  stored 
(-30°)  for  seven  months.  Frozen  roasts  were  transported  to 
the  Meat  Laboratory  at  the  Edmonton  Research  Station  and  cut 
into  three  3.8  cm  steaks,  starting  from  the  cut  surface 
(approximate  centre  of  the  muscle).  Care  was  taken  to  cut 
each  of  the  steaks  perpendicular  to  the  muscle  fiber 
direction.  Each  steak  was  weighed,  placed  individually  in  a 
polyethylene  bag,  sealed  with  a  twist  tie  and  drugstore 
wrapped  in  waxed  freezer  paper.  The  packaged  frozen  steaks 
were  stored  (-25°)  for  one  to  two  months.  The  remainder  of 
each  roast  was  wrapped  in  aluminum  foil,  placed  in  a 
polyethylene  bag  and  stored  (-25°)  for  later  chemical 
analyses . 

Prior  to  cooking,  each  steak  was  thawed  in  its 
packaging  for  3  h  at  22°  and  20  h  at  3°. 
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Cooking  Procedure 

Each  steak  was  placed  proximal  side  up  on  a  rack  in  an 
aluminum  roasting  pan  (41  x  29  x  5  cm).  The  steaks  were 
roasted  individually  to  an  internal  temperature  of  65°  in 
one  of  four  household  ovens  (Kenmore,  Mark  3)  maintained  at 
176°.  The  internal  temperature  of  each  steak  was  monitored 
by  two  copper-constantan  thermocouples  connected  to  a 
Honeywell  recording  potentiometer.  After  cooling  to  50°, 
each  cooked  steak  was  weighed,  wrapped  in  plastic  wrap  and 
refrigerated  (3°)  for  approximately  one  hour  to  facilitate 
sampling  of  the  meat. 

Sampling  Procedure 

The  sampling  pattern  for  sensory  and  objective  tests 
(Figure  2)  was  standardized  during  preliminary  work. 

Each  cooked  steak  was  cut,  parallel  to  muscle  fiber 
direction,  into  four  (steaks  1  and  2)  or  three  (steak  3) 

1.3  cm  thick  slices.  Steak  slices  were  cut  into  rectangular 
cores  (1.3  x  1.3  x  3.2  cm)  using  an  electric  meat  slicer 
(Berkel  model  1836)  and  a  double-bladed  scalpel.  After 
removing  the  browned  exterior  surface,  two  1.3  cm  cubes  were 
cut  from  the  centre  of  each  of  18  cores  for  each  of  the  18 
panelists  (Figure  2).  The  cubes  were  refrigerated  (4°)  on  a 
covered  plate  for  one  half  to  three  and  one  half  hours 
before  evaluation. 
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Proximal  surface  of  left  semimembranosus  steak 
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Cores  for  Warner  Bratzler  shear  measurements  and  press 
fluid  determinations  were  taken  from  positions  in  steaks  1 
and  2.  A  portion  (5  x  2  x  3.2  cm)  was  removed  from  each 
steak,  wrapped  securely  in  plastic  wrap  and  aluminum  foil 
and  used  for  Hunter  color  measurements. 

Objective  Measurements  on  Raw  Samples 

Fat  and  Moisture 

The  percentages  of  fat  (ether  extract)  and  moisture  in 
each  raw  roast  were  determined  by  the  methods  of  the 
Association  of  Official  Analytical  Chemists  (AOAC,  1980).  A 
sample  (100-200  g)  from  the  distal  end  of  each  roast  was 
thawed  (4°),  trimmed  of  connective  tissue  and  fat,  ground 
(30  sec)  to  an  homogeneous  paste,  freeze-dried  (20  h)  and 
reground.  For  fat  determinations,  duplicate  2  g  portions  of 
each  freeze-dried  sample  were  placed  on  a  Goldfisch 
extraction  apparatus  for  18  h  and  the  resulting  ether 
extract  weighed.  Moisture  was  determined  by  drying 
duplicate  2  g  portions  of  each  freeze-dried  sample  for  an 
additional  18  h  at  105°. 

PH 

A  Fisher  Accumet  Model  230  pH/ion  meter  used  to 
determine  the  raw  pH  of  each  roast.  A  thawed  20  g  sample 
from  each  roast,  free  of  fat  and  connective  tissue,  was 
blended  with  100  ml  distilled  water  (60  sec)  and  filtered 
into  two  beakers  to  give  duplicate  readings. 
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Drip  in  Thaw 

The  weight  of  drip  in  thaw  of  each  steak  was  calculated 
as  a  percentage,  based  on  the  weight  of  the  frozen  steak. 

Objective  Measurements  on  Cooked  Samples 

Cooking  Time 

The  cooking  time  required  for  each  steak  to  reach  an 
internal  temperature  of  65°  from  an  initial  temperature  of 
10°  was  determined.  The  cooking  time  of  each  steak  in  min 
per  100  g  raw  weight  was  also  calculated. 

Cooking  Losses 

Percentage  total,  volatile  and  drip  losses,  based  on 
the  weight  of  each  thawed  trimmed  steak  were  calculated. 

Warner  Bratzler  Shear 

Meat  cores  (1.3  x  1.3  x  3.2  cm)  were  removed  from 
specified  positions  on  steaks  1  and  2  of  each  roast, 
adjacent  to  those  used  for  sensory  evaluation  (Figure  2). 

The  cores  (21°)  were  sheared  once,  perpendicular  to  the 
fiber  direction,  with  a  Warner  Bratzler  shear  attachment  on 
the  Ottawa  Texture  Measuring  System  (OTMS)  (up  speed,  0.25 
cm/sec;  5  %  range  selection).  An  average  of  nine  shear 
values  were  obtained  for  each  steak. 
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Press  Fluid 

The  percentage  of  press  fluid  in  the  cooked  steaks  was 
determined  using  a  modification  of  the  method  of  Stanley  and 
Swatland  (1976).  Triplicate  0.5  g  samples,  taken  from 
steaks  1  and  2,  were  placed  individually  between  two  pieces 
of  preweighed  5  cm2  aluminum  foil.  Three  pieces  of  Whatman 
No.  1  filter  paper  (11  cm  in  diam)  were  placed  outside  each 
piece  of  foil.  Each  sample  was  pressed  separately  between 
two  plexiglass  plates  (60  sec;  878.8  kg/cm2  force)  using  a 
Carver  Laboratory  Press  (Fred  S.  Carver,  Inc.,  Summit,  New 
Jersey).  The  pressed  meat  and  foil  were  weighed  immediately 
after  pressing  to  determine  moisture  loss.  The  percentage 
of  press  fluid  was  calculated  by  dividing  the  weight  of  the 
expressed  liquid  by  the  weight  of  the  original  meat  sample. 

Color 


The  color  of  each  cooked  steak  was  measured  using  a 
Hunterlab  Color  Difference  Meter  (Model  D25A-2)  standardized 
against  a  white  tile  with  values  of  1=92.7,  a=-1.0  and 
6=0.3.  An  exposed  interior  surface  of  each  steak  (Figure  2) 
was  placed  under  the  instrument  port.  Lightness  ' L' , 
redness  'a'  and  yellowness  'b'  values  were  recorded  for  each 
sample.  Then,  each  sample  was  rotated  90°  and  a  second 
reading  was  made  for  each  color  difference  factor. 
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Degree  of  Doneness  of  Cooked  Samples 

During  sample  preparation,  the  internal  color  of  each 
cooked  steak  was  assessed  by  three  experienced  judges. 

After  each  steak  was  cut  into  slices  (Figure  2),  the  three 
judges  independently  scored  the  degree  of  doneness  of  the 
second  slice  using  a  double-pointed  category  scale  (^well- 
done,  5=medium,  9=rare)  (Appendix,  Figure  13,  page  132). 

Sensory  Methodology 

Eighteen  trained  panelists  evaluated  the  organoleptic 
quality  of  steaks  from  each  of  four  postmortem  processing 
treatments  using  category  scaling  (CS),  line  scaling  (LS), 
and  magnitude  estimation  (ME).  Panelists  assessed  five 
descriptive  attributes  in  each  beef  sample:  initial 
tenderness,  overall  tenderness,  connective  tissue  amount, 
overall  juiciness  and  flavor  intensity. 

Selection  of  Panelists 

Panelists  were  screened  by  a  procedure  similar  to  that 
used  by  Cross  et  al.  (1978).  Twenty-eight  students  and 
staff  in  the  Department  of  Foods  and  Nutrition,  University 
of  Alberta,  participated  in  a  series  of  16  triangle  tests. 
During  each  triangle  test,  panelists  were  asked  to  pick  the 
odd  sample  with  respect  to  tenderness,  juiciness  or  amount 
of  connective  tissue  and  to  indicate  the  degree  and 
direction  of  difference.  Panelists  correctly  identified  the 
odd  sample  73  %  of  the  time,  with  a  range  of  56-96  %  over 
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the  16  triangles.  Twenty-three  panelists  were  selected  for 
training  on  the  basis  of  their  ability  to  correctly  identify 
the  odd  sample,  interest  in  the  study  and  availability  for 
the  duration  of  the  study. 

Training  of  Panelists 

Training  sessions  were  held  two  to  three  times  per  week 
for  10  weeks.  The  first  session  introduced  panelists  to 
sensory  evaluation  and  acquainted  them  with  terms  used  for 
general  texture  description.  Panelists  ranked  standard 
foods  used  as  anchors  for  softness  and  juiciness  (Forbes, 
1973)  and  then  scored  the  softness  or  juiciness  of  these 
foods  on  eight-point  scales.  For  example,  foods 
representing  different  intensities  of  juiciness  included: 
shortcake  biscuits,  dehydrated  apples,  canned  potatoes, 
raisins  and  canned  mushroom  caps. 

During  training,  panelists  were  gradually  acquainted 
with  the  three  evaluation  techniques,  panel  procedures  and 
the  quality  attributes  under  investigation.  Definitions 
provided  for  each  of  the  descriptive  attributes  are  shown  in 
Figure  3.  The  CS  method  was  introduced  first.  Panelists 
were  asked  to  rate  each  attribute  in  the  meat  samples  using 
an  eight-point  desciptive  category  scale.  A  value  of  8 
represented  extremely  tender,  juicy,  meaty  and  no  connective 
tissue  and  a  value  of  1  represented  extremely  tough,  dry, 
weak  and  abundant  connective  tissue.  The  final  CS  scorecard 
^ 2  presented  in  Figure  4.  Each  panelist  standardized 
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INITIAL  TENDERNESS  - 


OVERALL  TENDERNESS  - 


CONNECTIVE  TISSUE  — 


OVERALL  JUICINESS  -- 


FLAVOR  INTENSITY  -- 


-  is  the  lack  of  force  required  to  bite 
through  a  cube  of  beef  across  the 
grain,  between  the  molar  teeth 
(evaluated  after  two  chews). 


-  is  the  amount  of  effort  and  time 
required  to  completely  masticate  a 
cube  of  beef. 


is  the  amount  of  residue  felt  during 
chewing  and  left  after  complete 
mastication . 


is  the  amount  of  moisture  left  in  the 
mouth  after  complete  mastication. 


is  the  amount  of  meaty  flavor  present 
in  the  mouth  after  complete 
mastication . 


Figure  3.  Quality  attribute  definitions  used  for 
the  sensory  evaluation  of  beef. 
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SAMPLE  # _  DATE _  NAME 


INITIAL  TENDERNESS 

(2  chews) 


8  extremely  tender 
7  very  tender 
6  moderately  tender 
5  slightly  tender 
4  slightly  tough 
3  moderately  tough 
2  very  tough 
1  extremely  tough 

CONNECTIVE  TISSUE  AMT. 


8  none 

7  practically  none 
6  traces 
5  slight 
4  moderate 
3  slightly  abundant 
2  moderately  abundant 
1  abundant 

FLAVOR  INTENSITY 


8  extremely  meaty 
7  very  meaty 
6  moderately  meaty 
5  slightly  meaty 
4  slightly  weak 
3  moderately  weak 
2  very  weak 
1  extremely  weak 

COMMENTS 


Figure  4.  Category  scale  scorecard  for  the  sensory 

evaluation  of  cooked  semimembranosus  steaks. 


OVERALL  TENDERNESS 

(#  chews  ) 


8  extremely  tender 
7  very  tender 
6  moderately  tender 
5  slightly  tender 
4  slightly  tough 
3  moderately  tough 
2  very  tough 
1  extremely  tough 

OVERALL  JUICINESS 


8  extremely  juicy 
7  very  juicy 
6  moderately  juicy 
5  slightly  juicy 
4  slightly  dry 
3  moderately  dry 
2  very  dry 
1  extremely  dry 
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his/her  overall  tenderness  scores  by  the  number  of  chews 
required  to  completely  masticate  a  cube  of  beef. 

During  the  second  week  of  training,  the  LS  technique 
was  introduced.  Each  panelist  was  instructed  to  mark  a 
vertical  line  across  a  15  cm  line  at  the  point  which  best 
described  his/her  impression  of  each  of  the  atributes  in  the 
beef  cubes.  The  line  scale  was  anchored  1.3  cm  from  each 
end  with  the  appropriate  descriptors  (very  tender--very 
tough;  small  amount  connective  t i ssue--large  amount 
connective  tissue;  very  juicy--very  dry;  very  meaty--very 
weak).  A  value  from  0.0  to  15.0  was  assigned  to  each  rating 
by  converting  the  mark  on  the  line  to  a  numerical  score. 

The  final  LS  scorecard  is  presented  in  Figure  5. 

Magnitude  estimation  was  introduced  during  the  last 
four  weeks  of  panel  training.  To  acquaint  panelists  with 
the  method  of  modulus  free  ME,  panelists  evaluated  shapes 
and  lines  as  described  by  Moskowitz  (1977).  Each  judge  then 
evaluated  the  quality  attributes  of  the  beef  samples  by 
assigning  any  number  (greater  than  0)  of  his/her  choice  to 
describe  each  attribute  in  the  first  sample  presented.  For 
each  successive  sample,  panelists  assigned  numbers  for  each 
attribute  in  relation  to  those  attributes  in  the  previous 
sample.  Panelists  were  instructed  that  in  ME,  the  ratio 
relationship  of  the  numbers  assigned  was  more  important  than 
the  actual  numbers  assigned.  Since  panelists  had  difficulty 
scoring  five  attributes  simultaneously,  the  final  ME 
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SAMPLE  # _  DATE _  NAME 


INITIAL  TENDERNESS 


very 

very 

tender 

tough 

OVERALL  TENDERNESS 


I - 

very 

- 1 - 

very 

tender 

tough 

CONNECTIVE  TISSUE  AMT. 


- 1 - 

small 

large 

amount 

amount 

OVERALL  JUICINESS 


very 

- 1 - 

very 

juicy 

dry 

FLAVOR  INTENSITY 

very 

very 

meaty 

weak 

COMMENTS 


Figure  5.  Line  scale  scorecard  for  the  sensory  evaluation 
of  cooked  semimembranosus  steaks. 
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scorecard  was  divided  into  two  parts  (Figure  6). 

The  introduction  and  refinement  of  each  sensory 
procedure  followed  a  similar  format.  Panelists  individually 
evaluated  samples  and  then  participated  in  a  round  table 
discussion.  The  group  leader  recorded  results  using  an 
overhead  projector  and  encouraged  participation  by  all 
panelists.  For  CS  and  LS,  actual  scores  were  recorded; 
however,  to  avoid  influencing  panelists'  choice  of  numbers 
when  using  ME,  only  relative  differences  between  meat 
samples  were  shown.  Discussion  of  results  and  comments  by 
panelists  during  these  sessions  helped  to  develop  panel 
consistency  and  to  improve  panelists’  understanding  of  the 
attributes  under  investigation.  Panelists  scored  a  wide 
variety  of  beef  samples  (different  muscles  from  animals  of 
various  ages  were  cooked  to  varying  degrees  of  doneness) 
using  each  evaluation  technique  (Table  2).  The  number  of 
attributes  and  samples  evaluated  per  session  was  increased 
gradually  for  each  procedure. 

The  last  two  weeks  of  training  were  arranged  to 
simulate  actual  testing  conditions.  The  samples  evaluated, 
taste  panel  booth  arrangement  and  the  scorecards  used  were 
similar  to  those  in  the  actual  study.  Panelists 
individually  evaluated  samples  in  a  taste  panel  room  and 
then  participated  in  a  brief  discussion  after  each  session. 
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DATE 


NAME 
CUBE  ONE 


SAMPLE  CODE 


INITIAL 

TENDERNESS 

( t#  =  t  toughness ) 

FLAVOR 

INTENSITY 

( f #=tmeat iness ) 

OVERALL 

JUICINESS 

( t#= t  juic i ness ) 


COMMENTS 


DATE 


NAME 

CUBE  TWO 


SAMPLE  CODE 


OVERALL 

TENDERNESS 

( f #=t toughness ) 

CONNECTIVE 
TISSUE  AMT. 

( t#=t amount ) 


COMMENTS 


Figure  6.  Magnitude  estimation  scorecard  for  the  sensory 
evaluation  of  cooked  semimembranosus  steaks. 
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Evaluation  of  Panelist  Performance 

Panelist  performance  was  evaluated  after  four  and  eight 
weeks  of  training  using  the  procedure  described  by  Cross  et 
al.  (1978)  and  the  AMSA  (1978).  The  first  evaluation  (four 
replications  of  six  treatments)  gave  a  general  indication  of 
the  panelists'  ability  to  discriminate  differences  in  the 
palatability  characteristics  of  the  samples  presented.  The 
F-value  from  a  one-way  analysis  of  variance  for  each 
palatability  characteristic  was  used  as  an  indicator  of  each 
panelist's  performance.  A  high  F-value  indicated  that  the 
panelist  was  consistent  in  making  duplicate  judgements  and 
could  discriminate  differences  in  the  treatments  presented 
for  evaluation.  Those  characteristics  (for  example,  amount 
of  connective  tissue)  that  panelists  had  difficulty  in 
differentiating  were  stressed  in  later  training  sessions. 

The  second  evaluation  of  panelist  performance  (four 
replications  of  eight  treatments)  was  used  primarily  to 
select  panelists  for  the  study.  Eighteen  panelists  (17 
females,  one  male;  18  to  35  years)  who  were  sensitive  to 
sample  differences,  consistent  over  time  and  highly 
motivated  were  chosen  to  participate  in  the  study.  Prior  to 
training,  16  of  the  18  were  naive  to  sensory  evaluation. 

Sample  Presentation 

Panel  sessions  were  held  daily  between  1030  h  and 
1430  h  in  an  atmospherically-controlled  sensory  panel  room 
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equipped  with  eight  individual  booths  and  white  lights. 

Each  panelist  received  two  1.3  cm  cubes  of  beef  from 
designated  positions  in  each  of  two  sets  of  four  steaks 
(which  represented  two  replications).  The  samples  (in  small 
covered  glass  jars)  had  been  warmed  to  50°  in  Corningware 
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Table  3. 


Assignment  of  the  three  evaluation 
o  panelists  during  each  phase2  of 


procedures' 
the  study. 


Panelist 

Phase 

Phase 

Phase 

1 

2 

3 

1  - 

3 

CS 

ME 

LS 

4  - 

6 

CS 

LS 

ME 

7  - 

9 

LS 

CS 

ME 

10  - 

12 

ME 

CS 

LS 

.13  - 

15 

LS 

ME 

CS 

16  - 

18 

ME 

LS 

CS 

’Category  scaling  (CS), 
estimation  (ME) . 


line  scaling  (LS)  and  magnitude 


2 Each  phase  involved  the  evaluation  of  nine  steaks  per 
treatment.  F 
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Individual  panelists'  scores  for  each  attribute  were 
tabulated  for  statistical  analysis.  If  data  were  missing 
for  an  attribute,  the  mean  value  of  that  panelist's  scores 
for  the  other  replications  of  that  treatment  was  utilized 
for  data  analysis. 

Panelists'  impressions  regarding  each  of  the  sensory 
evaluation  procedures  was  obtained  in  two  ways.  After 
completing  each  phase  of  the  study,  each  panelist  was  given 
an  open-ended  questionaire  to  record  his/her  impressions  of 
the  evaluation  technique  just  employed  (Appendix,  Figure  17, 
page  136).  For  each  method,  panelists  were  asked  to  comment 
on  ease  of  learning,  length  of  training  provided, 
applicability  for  the  sensory  evaluation  of  beef,  effort 
needed  for  sample  evaluation  and  on  scale  accuracy.  In 
addition,  panelists  were  asked  to  list  desirable  and 
undesirable  characteristics  of  each  evaluation  technique. 
Upon  completion  of  the  study,  the  panelists  individually 
ranked  the  three  evaluation  procedures  according  to  selected 
criteria  (Appendix,  Figure  18,  page  137). 

As  a  motivational  tool  and  to  show  appreciation  for  the 
taste  panelists'  participation,  small  treats  were  given 
after  each  panel  session.  Judges  also  received  a  small 
honorarium  at  the  end  of  the  study. 
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RESULTS  AND  DISCUSSION 


Because  the  electrical  stimulation  treatments  (delay 
chilling  with  electrical  stimulation  (DC-ES)  and  hot  boning 
with  electrical  stimulation  (HB-ES))  resulted  in  few 
significant  differences  and  contributed  little  to  a 
comparison  of  sensory  evaluation  techniques  for  the 
descriptive  sensory  assessment  of  meat,  the  "Results  and 
Discussion"  have  been  limited  to  a  discussion  of  data  for 
the  delay  chilling  (DC)  and  hot  boning  (KB)  treatments.  In 
this  and  subsequent  sections,  the  abbreviations  DC  and  HB 
also  indicate  delay  chilled  and  hot  boned,  respectively. 

For  most  of  the  tables  presented  in  the  text,  corresponding 
tables  presenting  data  for  the  four  treatments  appear  in  the 
Appendices  (Tables  16  to  23,  pages  138  to  145). 

Objective  Measurements 

Means  and  standard  errors  for  chemical  measurements  and 
cooking  data  for  steaks  from  the  DC  and  HB  treatments  are 
presented  in  Table  4.  Data  for  pH  and  for  percentages  of 
ether  extract  and  total  moisture  indicate  that  the  steaks 
from  both  treatments  were  similar.  The  semimembranosus  (SM) 
muscles  used  for  this  study  were  very  lean  with  an  average 
ether  extract  of  1.4  %.  Hawrysh  and  Berg  (1976)  reported  a 
similar  ether  extract  (1.7  %)  for  the  semi tendinosus  (ST) 
muscles  of  eight  Grade  A1  steers. 
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a  e  4.  Means  and  standard  errors  for  chemical  and  cooking 
data  for  semimembranosus  steaks  from  delay-chilled 
and  hot-boned  treatments. 


Postmortem  Treatment 

Measurement 

Delay-Chilled  Hot-Boned 

SEM 

Chemical  data2 

pH 

5.5 

5.6 

0.07 

Ether  extract,  % 

1  .3 

1.4 

0.21 

Total  moisture,  % 

73.2 

73.7 

0.30 

Cooking  data3 

Raw  weight,  g 

457.7 

449.6 

52.31 

Drip  in  thaw,  % 

5.6 

7.8 

0.20*** 

Ini t ial  temp. ,  °C 

4.8 

6.0 

1  .39 

Final  internal  temp.,  °C 

65.  1 

65.  1 

0.16 

Cooking  time,  min 

55.3 

58.4 

2.75 

Cooking  time,  min/IOOg 

12.2 

13.3 

0.99 

Cooking  losses,  % 

Total 

23.  1 

22.  1 

1  .29 

Volatile 

20.  1 

19.2 

1.01 

Drip 

3.0 

2.9 

0.72 

’Standard  error  of  the  mean. 

2Values  are  the  means  of  18  determinations,  2  per  replicate. 

3Values  are  the  means  of  27  determinations,  1  per  steak  per 
replicate . 

***  Significant  at  P<0.001. 
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The  raw  weight  of  steaks  from  the  DC  and  HB  postmortem 
treatments  was  similar  (Table  4).  However,  HB  steaks  had  a 
significantly  greater  percentage  drip  in  thaw  than 
comparable  DC  steaks.  Since  studies  comparing  the  thaw 
losses  of  DC  and  HB  treatments  are  lacking,  the  reason  for 
this  result  obtained  in  the  present  study  is  not  readily 
apparent.  Delay-chilled  and  hot-boned  treatments  generally 
have  similar  thaw  losses  to  their  conventionally-processed 
counterparts.  Thaw  losses  were  similar  for  longissimus 
steaks  from  DC  (16°  for  12  h)  and  control  treatments  (16° 
for  1  h,  then  2°)  (Crouse  et  al.,  1983).  Bowles  Axe  et  al. 
(1983)  observed  that  the  percentage  drip  in  thaw  of  SM 
muscles,  boned  2  h  postmortem  and  aged  for  either  24  h 
or  6  days,  was  similar  to  that  of  control  SM  muscles,  boned 
48  h  postmortem  and  aged  0  or  6  days.  Berry  and  Cross  (1982) 
also  reported  that  thaw  losses  of  SM  roasts  boned  2  h 
postmortem  did  not  differ  from  comparable  roasts  boned  at 
24  h  postmortem. 

There  was  no  significant  difference  in  the  final 
internal  temperature  attained  by  DC  and  HB  steaks  (Table  4). 
Steaks  were  removed  from  the  ovens  at  an  average  temperature 
of  65.1°  (range,  64.75°  to  65.50°)  and  did  not  exhibit  a 
post  oven  temperature  rise.  Using  a  modified  oven  roasting 
technique,  Moore  et  al.  (1980)  also  noted  that  SM  steaks 
cooked  to  65°  had  no  post  oven  temperature  rise. 
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The  average  cooking  times  (min  or  min  per  100  g)  and 
percentage  cooking  losses  for  steaks  showed  no  significant 
differences  attributable  to  postmortem  treatment.  In  the 
present  study,  steaks  took  an  average  total  cooking  time  of 
56.8  min  to  reach  65°  with  total,  volatile  and  drip  losses 
of  22.6  %,  19.6  %  and  3.0  %,  respectively.  Moore  et  al. 
(1980)  cooked  3.8  cm  SM  steaks  in  a  rotary  gas  oven  (177°) 
to  65°  in  55.2  min  and  obtained  total,  volatile  and  drip 
losses  of  20.6  %,  19.0  %  and  1.6  %,  respectively.  Batcher 
and  Deary  (1975)  roasted  3.8  cm  SM  steaks  to  either  60°  or 
71°  and  reported  total  cooking  losses  of  18.4  %  and  26.2  %, 
respectively . 

Few  studies  have  compared  the  cooking  losses  of  DC  and 
HB  treatments.  However,  Jeremiah  et  al.  (1984),  using 
similar  meat  and  the  same  postmortem  treatments  as  those  in 
the  present  study,  reported  that  cooking  losses  for  DC  and 
HB  SM  muscles  were  similar  (Table  5).  In  addition,  Jeremiah 
et  al.  (1984)  found  no  significant  differences  in  the 
cooking  losses  of  longissimus  (L)  and  triceps  brachii  (TB) 
muscles  attributable  to  postmortem  treatment. 

Most  researchers  (Joseph  and  Connolly,  1977;  Smith  et 
al.,  1979a,  1979b;  Crouse  et  al.,  1983;  Jeremiah  et  al., 
1984)  have  reported  that  cooking  losses  for  DC  meat  cuts  are 
similar  to  those  of  their  conventionally-chilled 
counterparts.  However,  Smith  et  al.  (1979a)  observed  that 
delay-chilled  SM  steaks  had  significantly  greater  cooking 
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Table  5.  Means  for  objective  and  sensory  data  for 

semimembranosus  steaks  from  delay-chilled  and 
hot-boned  treatments  (adapted  from  Jeremiah  et  al., 


Attribute 

Postmortem 

Delay-Chilled 

Treatment 

Hot-Boned 

Objective  data 

Total  cooking 
losses,  % 

31.0 

±  0 . 92 1 

30.2  ± 

0.81 

Warner  Bratzler 
shear,  kg 

6.7 

±  0.24 

8.0  ± 

0.45 

* 

Sensory  data2 

Initial  tenderness 

4.  1 

±  0.21 

3.2  ± 

0.25 

* 

Overall  tenderness 

4.  1 

±0.17 

3.2  ± 

0.20 

* 

Connective  tissue 

4.4 

±0.15 

3.7  ± 

0.18 

* 

Juiciness 

4.3 

±0.17 

4.7  ± 

0.18 

Flavor  intensity 

5.2 

±  0.09 

5.3  ± 

0.08 

’Standard  error. 

2Eight-point  category  scales.  Higher  values  indicate 
increased  tenderness,  juiciness,  flavor  intensity  and 
decreased  connective  tissue  perceptibility. 

*  Significant  at  P<0.05. 
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losses  than  conventionally-chilled  SM  steaks. 

Generally,  the  cooking  losses  of  hot-boned  SM  muscles 
(cooked  postrigor)  do  not  differ  significantly  from  muscles 
boned  at  24  h  postmortem  or  later  (Kastner  et  al.,  1973; 
Follett  et  al.,  1974;  Berry  and  Cross,  1982;  Bowles  Axe  et 
al.,  1983;  Jeremiah  et  al.,  1984). 

Table  6  summarizes  means  and  standard  errors  for  the 
degree  of  doneness  and  for  objective  measurements  of  cooked 
SM  steaks.  The  visual  evaluation  of  steak  doneness  by  a 
three  member  panel  indicated  that  the  HB  steaks  were  more 
rare  ( P<0 .01)  than  comparable  DC  steaks.  However,  steaks 
from  both  treatments  had  reached  the  same  average  internal 
temperature  of  65.1°  (Table  4,  page  56). 

Hunter  color  and  press  fluid  determinations  on  the 
cooked  DC  and  HB  steaks  supported  these  panel  results  which 
showed  a  significant  difference  in  steak  doneness  between 
the  two  postmortem  treatments.  Although  Hunter  L  and  b 
values  for  both  treatments  were  similar,  the  Hunter  a 
(redness)  value  for  HB  steaks  was  higher  (P<0.001)  than  that 
for  comparable  DC  steaks.  In  addition,  a  significantly 
greater  percentage  of  press  fluid  was  released  from  HB 
steaks  than  from  DC  steaks  (Table  6). 

Although  the  HB  steaks  in  the  present  study  took 
slightly  longer  to  cook  than  comparable  DC  steaks  (Table  4, 
page  56),  the  difference  was  not  statistically  significant. 
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Table  6.  Means  and  standards  errors  for  the  degree  of 

doneness  evaluation  and  objective  measurements  for 
semimembranosus  steaks  from  delay-chilled  and 
hot-boned  treatments. 


Measurement 

Postmortem 

Delay-Chilled 

Treatment 

Hot-Boned 

SEM1 

Degree  of  doneness2 

4.9 

5.6 

0 .  14** 

Hunter  L 3 

44.9 

42.6 

1  .93 

a 

6.8 

8.4 

0.09*** 

b 

10.4 

10.5 

0.13 

Press  fluid,  % 

36.5 

40.  1 

0.61** 

OTMS-Warner 

Bratzler  shear,  kg 

7.7 

8.7 

0.66 

1  Standard  error  of  the  mean. 

2 Doneness  scale:  1=well-done,  5=medium,  9=rare.  Values  are 
the  means  of  81  judgements,  one  per  steak  by  each  of 
three  panelists. 

3Values  are  the  means  of  54  determinations,  2  on  each  of 
three  steaks  per  replication. 

4Values  are  the  means  of  54  determinations,  3  on  each  of  two 
steaks  per  replication. 

5Values  are  the  means  of  156  determinations,  with  an  average 
of  9  shears  on  each  of  two  steaks  per  replication. 

**,  ***  Significant  at  P<0.01  and  P<0.001,  respectively. 
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There  is  a  lack  of  evidence  in  the  literature  to  indicate 
that  changes  in  the  rate  of  heat  penetration  of  this 
magnitude  (approximately  5  %)  would  produce  observable 
differences  in  the  degree  of  doneness.  Several  workers 
(Shaffer  et  al.,  1973;  Batcher  and  Deary,  1975;  Vollmar  et 
al.,  1976;  McDowell  et  al.,  1982)  have  shown  that  when  rates 
of  heat  penetration  between  cooking  methods  vary 
c on s i ae r ably ,  differences  in  degree  of  doneness  are 
sometimes  noted. 

The  difference  in  doneness,  Hunter  a  values  and 
percentages  of  press  fluid  between  the  DC  and  HB  steaks  in 
the  present  study  (Table  6)  may  have  been  due  to  postmortem 
processing  treatment.  However,  comparisons  of  data  for 
these  particular  measurements  (degree  of  doneness,  Hunter 
color  and  water-holding  capacity)  in  cooked  DC  and/or  HB 
meat  are  lacking. 

Occasionally,  raw  HB  meat  has  been  found  to  be  darker 
in  color  than  conventionally-boned  beef  (Cross  and  Tennent, 
1980;  Claus,  1982).  Jeremiah  et  al.  (1984),  using  similar 
SM  steaks  and  the  same  postmortem  treatments  as  the  present 
study,  reported  that  HB  steaks  were  darker,  as  determined 
instrumentally  and  subjectively,  than  comparable  DC  steaks. 
The  water-holding  capacity  of  raw  meat  cuts  generally  is  not 
influenced  by  either  DC  (Cliplef  and  Strain,  1976)  or  HB 
(Cross  and  Tennent,  1980).  However,  Kastner  et  al.  (1973) 
observed  that  the  amount  of  press  fluid  from  raw  meat  was 
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greater  for  meat  cuts  excised  at  5  and  8  h  postmortem  than 
for  control  cuts  excised  at  48  h.  These  findings  for 
postmortem  effects  on  color  and  water-holding  capacity  in 
raw  meat  are  interesting.  However,  conclusions  about  these 
postmortem  effects  for  cooked  meat  are  not  possible  since 
limited  information  is  available  on  the  relationship  between 
cooked  and  raw  meat  characteristics. 

Although  shear  force  values  for  HB  steaks  were  higher 
than  for  comparable  DC  steaks,  the  difference  was  not 
statistically  significant  (Table  6).  Studies  comparing  DC 
and  HB  treatments  are  lacking.  Using  steaks  similar  to 
those  in  the  present  study,  Jeremiah  et  al.  (1984)  reported 
that  HB  steaks  had  significantly  higher  shear  force  values 
than  the  DC  steaks  (Table  5,  page  59). 

In  several  studies  (Joseph  and  Connolly,  1977;  Smith  et 
al.,  1979a,  1979b;  Jeremiah  et  al.,  1984),  delay-chilled  SM, 
ST  and  biceps  femoris  (BF)  muscles  received  similar  shear 
values  to  comparable  conventionally-chilled  cuts.  The 
effect  of  DC  on  the  tenderness  of  the  L  muscle  has  been 
inconsistent.  Several  workers  (Smith  et  al.,  1979a,  1979b; 
Crouse  et  al.,  1983)  have  found  no  difference  in  shear 
values  between  DC  and  control  L  muscles.  However,  Cliplef 
and  Strain  (1976)  and  Lochner  et  al.  (1980)  reported  that 
delay-chilled  L  muscles  were  significantly  more  tender  than 
comparable  conventionally-chilled  muscles.  In  the  study  by 
Hostetler  et  al.  (1975),  a  general  improvement  in  tenderness 
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by  DC  was  observed;  however,  data  were  not  presented  to 
indicate  the  effects  of  delay  versus  conventional  chilling 
on  individual  muscles. 

Generally,  the  literature  indicates  that  hot-boned 
muscles  of  the  round  (SM,  ST,  BF )  cooked  post-rigor  are 
slightly  tougher  than  their  conventionally-boned 
counterparts  with  the  differences  occasionally  reaching 
statistical  significance  (Schmidt  and  Gilbert,  1970;  Kastner 
et  al.,  1973;  Schmidt  and  Keman,  1974;  Falk  et  al.,  1975; 
Kastner  and  Russell,  1975;  Bowles  Axe  et  al.,  1983). 

Sensory  Assessments 

Data  for  the  descriptive  sensory  evaluation  of  DC  and 
HB  steaks  obtained  from  18  trained  panelists  using  category 
scaling  (CS),  line  scaling  (LS)  and  magnitude  estimation 
(ME)  are  summarized  in  Table  7.  For  the  CS  and  LS 
evaluation  techniques,  higher  values  indicate  increased 
tenderness,  juiciness,  flavor  intensity  and  decreased 
connective  tissue  amount.  For  ME,  higher  values  indicate 
increased  toughness,  connective  tissue  amount,  juiciness  and 
flavor  intensity. 

When  the  trained  panel  employed  CS ,  the  HB  steaks  were 
rated  lower  (P<0.05)  in  initial  tenderness  than  comparable 
DC  steaks  (Table  7).  For  panelist  evaluations  by  both  LS 
and  ME,  HB  steaks  tended  to  be  less  tender  than  comparable 
DC  steaks,  although  the  differences  were  not  statistically 
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able  7.  Means  and . standards  errors  for  category  scaling, 
line  scaling  and  magnitude  estimation  of  semi¬ 
membranosus  steaks  from  delay-chilled  and  hot¬ 
boned  treatments. 


Postmortem  Treatment 


Attribute 

Delay-Chilled 

Hot-Boned 

SEM1 

Initial  tenderness 

Category  scaling2 

5.4 

4.5 

0.26* 

Line  scaling3 

9.8 

8.2 

0.50 

Magnitude  estimation4 

13.7 

15.5 

(0.034) 5 

Overall  tenderness 

Category  scaling 

5.  1 

4.2 

0.23* 

Line  scaling 

8.9 

7.6 

0.42 

Magnitude  estimation 

16.3 

20.0 

(0.023)* 

Connective  tissue  amount 

Category  scaling 

4.9 

4.6 

0.12 

Line  scaling 

9.7 

9.2 

0.36 

Magnitude  estimation 

10.0 

11.5 

(0.023) 

Overall  juiciness 

Category  scaling 

4.7 

5.0 

0 .05** 

Line  scaling 

7.7 

8.3 

0.25 

Magnitude  estimation 

12.3 

14.4 

(0.014)** 

Flavor  intensity 

Category  scaling 

5.0 

5.0 

0.10 

Line  scaling 

7.9 

8.0 

0.23 

Magnitude  estimation 

13.9 

14.4 

(0.012) 

’Standard  error  of  the  mean. 


2Maximum  score=8.  Higher  values  indicate  increased 
tenderness,  juiciness,  flavor  intensity  and  decreased 
connective  tissue  amount. 

3Maximum  score=15.  Higher  values  indicate  increased 
tenderness,  juiciness,  flavor  intensity  and  decreased 
connective  tissue  amount. 

4Modulus  free  scoring.  Higher  values  indicate  increased 
toughness,  connective  tissue  amount,  juiciness  and  flavor 
intensity . 

5Standard  error  expressed  as  log  10  value. 

*,  **  Significant  at  P<0.05  and  P<0.01,  respectively. 
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significant.  However,  the  treatment  difference  for  the 
evaluation  of  initial  tenderness  by  the  LS  technique 
approached  significance  (P=0.056). 

Using  an  eight-point  category  scale  similar  to  that 
employed  in  the  present  study,  Jeremiah  et  al.  (1984)  found 
that  SM  steaks  from  the  HB  treatment  were  significantly 
lower  in  initial  tenderness  than  steaks  from  the  DC 
treatment  (Table  5,  page  59).  In  addition,  the  difference 
in  initial  tenderness  of  0.9  units  between  the  DC  and  HB 
treatments  reported  by  Jeremiah  et  al.  (  1984  )  was  the  same 
as  that  obtained  for  similar  steaks  in  the  present  study. 
However,  the  actual  scores  reported  by  Jeremiah  et  al. 

(1984)  were  lower  than  the  means  obtained  in  the  present 
study.  Differences  in  cooking  procedure  may  have 
contributed  to  this  result.  Jeremiah  et  al.  (1984)  roasted 
2.5  cm  SM  steaks  in  a  convection  oven  (177°)  to  an  internal 
temperature  of  75  ±  3°;  in  the  present  study,  3.8  cm  steaks 
were  roasted  in  conventional  ovens  (176°)  to  65°. 

Data  for  overall  tenderness  (Table  7)  show  that  for  CS 
and  ME,  HB  steaks  were  less  tender  (P<0.05)  than  comparable 
DC  steaks.  When  panelists  employed  LS ,  the  overall 
tenderness  of  HB  steaks  was  also  lower  than  that  of 
comparable  DC  steaks,  but  the  difference  did  not  reach 
statistical  significance  (P=0.054). 

In  this  study,  sensory  assessments  of  tenderness  tend 
to  be  supported  by  shear  force  data  (Table  6).  Although  the 


. 


' 


■ 


■ 


67 


difference  was  not  statistically  significant,  the  average 
shear  force  value  for  HB  steaks  was  higher  than  that  of 
comparable  DC  steaks. 

Data  from  Jeremiah  et  al.  (1984)  for  overall  tenderness 
(Table  5,  page  59)  are  consistent  with  the  CS  and  ME  results 
of  the  present  study.  Using  an  eight-point  category  scale, 
Jeremiah  et  al.  (1984)  obtained  significantly  lower  overall 
tenderness  scores  for  HB  steaks  than  for  comparable  DC 
steaks.  The  magnitude  of  the  tenderness  difference  (0.9 
units)  between  DC  and  HB  steaks  in  the  study  by  Jeremiah  et 
al.  (1984)  is  the  same  as  that  obtained  for  similar  SM 
steaks  in  the  present  study.  As  noted  earlier  for  initial 
tenderness,  variations  in  the  cooking  procedures  employed  by 
Jeremiah  et  al.  (1984)  and  in  the  present  study  probably 
account  for  differences  in  the  actual  tenderness  data 
obta i ned . 

In  several  studies  (Joseph  and  Connolly,  1977;  Smith  et 
al.,  1979b;  Jeremiah  et  al.,  1984),  trained  panelists,  using 
eight-  or  nine-point  category  scales,  detected  no 
significant  differences  in  overall  tenderness  between  delay- 
and  conventionally-chilled  SM  muscles. 

Schmidt  and  Keman  (1974)  obtained  similar  tenderness 
scores  (nine-point  hedonic  scale)  for  SM  muscles  excised  at 
1  and  24  h  postmortem.  Results  of  duo  trio  tests  (Will  et 
al.,  1976)  indicated  that  SM  muscles  boned  at  3,  5  or  7  h 
postmortem  did  not  differ  significantly  from  control  muscles 
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boned  at  48  h.  Using  eight-point  category  scales,  Berry  and 
Cross  (1982)  found  that  SM  roasts  boned  3  h  postmortem  were 
similar  in  overall  tenderness  to  cuts  boned  at  24  h. 

However,  braised  SM  steaks  from  the  HB  treatment  were 
significantly  tougher  than  comparable  SM  steaks  from  the 
conventionally-boned  treatment  (Berry  and  Cross,  1982). 

In  a  recent  study  (Bowles  Axe  et  al.,  1983),  panelists 
rated  hot  boned  SM  steaks  (aged  24  h)  significantly  lower  on 
an  eight-point  tenderness  scale  than  comparable  control  cuts 
(aged  48  h) .  However,  Bowles  Axe  et  al.  (1983)  found  that 
with  six  days  of  aging,  HB  and  control  cuts  were  similar  in 
tenderness.  In  contrast,  Jeremiah  et  al.  (1984)  reported 
that  L,  TB,  and  SM  cuts  excised  40  min  postmortem  and  aged  6 
days  were  tougher  than  their  conventionally-boned 
counterparts . 

Using  CS ,  LS  and  ME,  panelists  did  not  detect 
significant  differences  in  the  amount  of  connective  tissue 
between  comparable  steaks  from  the  DC  and  HB  postmortem 
treatments  (Table  7).  However,  trends  for  CS  and  ME  data 
(P=0.07  and  P=0.08,  respectively)  indicate  that  more 
connective  tissue  was  detected  in  the  HB  steaks  than  in 
comparable  DC  steaks.  Using  procedures  similar  to  the 
present  study,  Jeremiah  et  al.  (1984)  observed  that 
hot-boned  SM  steaks  had  significantly  more  detectable 
connective  tissue  than  either  control  or  delay-chilled  SM 
steaks.  Semimembranosus  steaks  from  either  a  DC  treatment 
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(Smith  et  al.,  1979b)  or  a  HB  treatment  (Bowles  Axe  et  al., 
1983)  received  connective  tissue  scores  (eight-point  scale) 
similar  to  their  conventionally-processed  counterparts. 

Berry  and  Cross  (1982)  also  used  eight-point  category  scales 
and  reported  that  SM  roasts  boned  3  h  postmortem  were 
similar  in  amount  of  detectable  connective  tissue  to  cuts 
boned  at  24  h.  However,  braised  SM  steaks  from  the  HB 
treatment  had  significantly  more  connective  tissue  than 
comparable  conventionally-boned  SM  steaks  (Berry  and  Cross, 
1982) . 

When  panelists  employed  CS  and  ME,  they  scored  the 
juiciness  of  HB  steaks  significantly  higher  ( P< 0 .01)  than 
that  of  comparable  DC  steaks  (Table  7).  In  addition,  with 
the  LS  technique,  HB  steaks  tended  (P=0.12)  to  be  juicier 
than  comparable  DC  steaks.  Since  steaks  from  the  DC  and  HB 
treatments  attained  the  same  average  internal  temperature  of 
65.1°  (range,  64.75°  to  65.5°)  (Table  4,  page  56),  juiciness 
differences  were  not  expected.  However,  the  significant 
differences  for  juiciness  (Table  7)  are  supported  by 
objective  measurements  (Table  6).  Visual  evaluation  of 
steak  doneness  indicated  that  the  HB  steaks  were  more  rare 
(P<0.01)  than  comparable  DC  steaks.  Although  Hunter  L  and  b 
values  for  both  treatments  were  similar,  the  Hunter  a 
(redness)  value  for  HB  steaks  was  higher  ( P< 0.001 )  than  that 
of  comparable  DC  steaks.  In  addition,  a  greater  ( P< 0.01) 
percentage  of  press  fluid  was  released  from  HB  steaks  than 
from  comparable  DC  steaks. 
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Few  studies  have  compared  the  juiciness  of  meat  from 
the  DC  and  HB  postmortem  treatments.  Jeremiah  et  al. 

(1984),  using  an  eight-point  category  scale,  found  that  the 
juiciness  of  comparable  DC  and  HB  steaks  was  similar  (Table 
5,  page  59).  Delay-chilled  and  HB  meat  cuts  generally  have 
similar  juiciness  scores  to  their  conventionally-processed 
counterparts  (Smith  et  al.,  1979b;  Bowles  Axe  et  al.,  1983; 
Crouse  et  al.,  1983;  Crouse  and  Seideman,  1984;  Jeremiah  et 
al.,  1984).  However,  in  the  study  by  Schmidt  and  Keman 
(1974),  SM  steaks  from  the  HB  treatment  tended  to  be  more 
juicy  than  comparable  steaks  from  the  control  treatment. 
Although  the  difference  was  not  statistically  significant, 
juiciness  means  for  the  HB  and  control  treatments  were  5.1 
and  4.1,  respectively,  on  a  nine-point  scale  (Schmidt  and 
Keman ,  1  974  )  . 

In  the  present  study,  the  difference  in  juiciness 
between  the  HB  and  DC  steaks  was  very  small  and  may  not  be 
of  practical  importance.  However,  it  is  interesting  to  note 
that  for  both  the  CS  and  ME  procedures,  the  panelists  were 
sensitive  (P<0.01)  to  these  small  differences  in  juiciness. 

Employing  each  of  the  three  evaluation  techniques, 
trained  panelists  detected  no  differences  in  flavor 
intensity  attributable  to  postmortem  processing  treatment 
(Table  7).  Jeremiah  et  al.  (1984)  also  reported  that  flavor 
scores  for  HB  and  DC  steaks  were  not  significantly  different 
(Table  5,  page  59).  Flavor  intensity  and/or  flavor 
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desirability  of  either  DC  or  HB  SM  muscles  is  usually 
similar  to  that  of  conventionally  processed  SM  cuts  (Schmidt 
and  Reman,  i974;  Smith  et  al.,  1979b;  Bowles  Axe  et  al., 
1983;  Jeremiah  et  al.,  1984). 

Results  (Table  7)  for  the  descriptive  sensory 
assessment  of  the  quality  attributes  of  DC  and  HB  steaks, 
show  that  evaluations  by  the  18  trained  panelists  were 
consistent  among  the  sensory  evaluation  techniques. 

Although  there  were  differences  in  statistical  significance, 
the  direction  and  extent  of  the  difference  between  mean 
scores,  assigned  to  each  of  the  quality  attributes  in  the  DC 
and  HB  steaks,  were  generally  similar  for  CS,  LS  and  ME. 
Panelist  assessments  by  each  of  the  three  sensory  evaluation 
procedures  indicated  that  HB  steaks  were  tougher,  more  juicy 
and  tended  to  have  more  detectable  connective  tissue  than 
comparable  DC  steaks.  Using  CS,  LS  and  ME,  panelists 
assigned  scores  indicating  similar  flavor  intensity  to 
steaks  from  both  postmortem  treatments.  In  addition,  for 
each  evaluation  technique,  differences  between  steak 
treatment  means  tended  to  be  large  for  initial  tenderness 
and  overall  tenderness,  intermediate  for  amount  of 
connective  tissue  and  juiciness,  and  slight  for  flavor 
intensity . 

Distribution  of  Tenderness  Data 

Distributions  of  the  overall  tenderness  assessments  by 
the  18  panelists  for  the  DC  and  HB  postmortem  treatments  are 
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illustrated  in  Figures  7  and  8.  For  evaluations  of  overall 
tenderness  with  CS,  the  frequency  of  usage  of  each  category 
was  plotted  for  each  of  the  two  postmortem  treatments. 
Frequency  of  usage  of  appropriate  scale  intervals  for 
assessments  with  LS  and  ME  was  also  plotted.  For  each 
postmortem  treatment,  eight  intervals  were  plotted  for  CS, 
16  for  LS,  20  for  ME  (real  numbers)  and  17  for  ME 
( logarithms ) . 

For  the  CS  technique  (Figure  7a),  the  entire 
eight-point  scale  was  used  for  overall  tenderness 
assessments  with  no  apparent  avoidance  of  the  scale 
extremes.  Steaks  from  the  HB  treatment  tended  to  receive 
lower  tenderness  scores  than  comparable  samples  from  the  DC 
treatment.  This  finding  supports  results  from  the  analysis 
of  variance  of  these  data  which  showed  a  significant 
difference  in  overall  tenderness  attributable  to  postmortem 
treatment . 

For  continuous  distributions,  the  Kolmogorov-Smi rnov 
goodness-of-f i t  test  (Steel  and  Torrie,  1980)  has  been  used 
to  test  whether  or  not  the  data  follow  a  normal 
distribution.  However,  because  CS  data  is  from  eight 
discrete  categories,  this  test  was  not  considered 
appropriate  (Keeping,  1962). 

Overall  tenderness  responses  for  LS  for  the  two 
postmortem  treatments  were  evenly  distributed  along  most  of 
the  scale  length  (Figure  7b).  However,  for  each  postmortem 
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igure  7.  Frequency  of  panelists’  usage  of  (a)  the 

eight-point  category  scale  (N=310)  and  (b)  the 
cm  line  scale  (N=322)  for  the  assessment  of 
overall  tenderness  in  delay-chilled  (•)  and 
hot-boned  (■)  steaks. 
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MAGNITUDE  ESTIMATION  (real  numbers) 


MAGNITUDE  ESTIMATION  (logarithms) 


■ost  tender  Scale  Range  ■ost  tough 


Figure  8.  Frequency  of  panelists'  usage  of  values  for 

magnitude  estimation  (N=312)  (a)  before  and  (b) 
after  logarithmic  transformation  of  the  assessment 
of  overall  tenderness  in  delay-chilled  (•)  and 
hot-boned  (■)  steaks. 
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treatment,  few  panelists  provided  ratings  near  the  ends  of 
the  line  scale.  The  descriptive  anchors  (located  1.3  cm 
from  each  end  of  the  scale)  (See  Figure  5,  page  47)  may  have 
constrained  panelists'  use  of  the  scale  extremes.  As 
determined  for  CS,  HB  steaks  tended  to  receive  lower 
tenderness  scores  than  comparable  DC  steaks.  However,  in 
the  analysis  of  variance  of  overall  tenderness  data  for  LS , 
this  difference  in  overall  tenderness  between  HB  and  DC 
steaks  did  not  reach  statistical  significance. 

The  frequency  of  panelists'  usage  of  numerical  values 
for  the  ME  assessment  of  overall  tenderness  in  steaks  from 
DC  and  HB  is  depicted  in  Figure  8a.  For  each  postmortem 
treatment,  the  ME  responses  are  very  skewed  to  the  left.  A 
logarithmic  transformation  of  the  tenderness  data  (Figure 
8b)  changed  the  distributions  of  ME  responses  from 
log-normal  to  approximately  normal.  Some  points  along  the 
ME  scale  were  more  frequently  used  due  the  panelists'  choice 
of  certain  favored  numbers  (10,  15,  20).  Thus  the 
transformed  ME  distributions  for  the  two  postmortem 
treatments  remained  somewhat  jagged.  Overall  tenderness 
distributions  for  the  ME  responses  (Figure  8a  and  8b)  show 
that  steaks  from  the  HB  treatment  tended  to  be  less  tender 
than  comparable  steaks  from  the  DC  treatment.  These  data 
support  results  of  the  analysis  of  variance  of  ME  overall 
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In  general,  panelist  responses  (Figures  7  and  8)  by 
each  evaluation  technique  show  that  the  HB  steaks  tended  to 
be  less  tender  than  comparable  DC  steaks.  For  each 
postmortem  treatment,  panelist  assessments  of  overall 
tenderness  with  CS  and  LS  were  more  evenly  distributed  than 
corresponding  overall  tenderness  assessments  with  ME. 

Sources  of  Variation  in  the  Statistical  Analyses 

The  magnitude  and  significance  of  F-values  for  selected 
sources  of  variation  from  the  split-plot  analyses  of 
variance  of  category  scale,  line  scale  and  magnitude 
estimation  data  were  examined.  The  valid  error  terms  used 
for  testing  the  significance  of  these  sources  of  variation 
were  shown  in  Figure  1,  page  31. 

Treatment  F-Values 

Treatment  F-values  (Table  8)  were  used  as  a  measure  of 
the  sensitivity  of  each  evaluation  technique  to  differences 
in  palatability  due  to  postmortem  treatment.  For  initial 
tenderness,  the  treatment  F-value  for  CS  was  significant  and 
larger  than  comparable  F-values  for  the  LS  and  ME  methods. 
The  LS  F-value  approached  statistical  significance 
(P=0.056).  For  ME,  the  treatment  F-value  was  very  low  and 
statistically  nonsignificant.  Thus,  for  initial  tenderness, 
the  CS  technique  was  most  sensitive  to  treatment 
differences,  followed  by  the  LS  technique.  When  panelists 
used  the  ME  method,  differences  in  initial  tenderness 
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Table  8.  Treatment  F-values  for  quality  attributes  from  the 
analyses.of  variance  of  category  scale,  line  scale 
and  magnitude  estimation  data. 


Attribute 

Treatment  F- 

values 

Category 

Scaling 

Line 

Scaling 

Magn i tude 
Estimation 

Initial  tenderness 

6.63* 

4.97 

1  .22 

Overall  tenderness 

7.33* 

5.11 

7.30* 

Connective  tissue  amount 

4.33 

1.21 

3.92 

Overall  juiciness 

13.55** 

3.03 

12.26** 

Flavor  intensity 

0.00 

0.10 

0.82 

*,  **  Significant  at  P<0.05  and  P<0.01,  respectively. 
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between  the  DC  and  HB  treatments  were  poorly  discriminated. 

Difficulties  with  the  ME  procedure  may  have  contributed 
to  panelists'  inability  to  discriminate  differences  in 
initial  tenderness  between  the  two  postmortem  treatments. 
Because  of  the  effort  and  time  required  for  sample 
mastication,  panelists  noted  difficulty  in  remembering  their 
impressions  of  one  sample  relative  to  another.  However, 
Moskowitz  (1983)  observed  that  panelists  develop  their  own 
internal  frame  of  reference  for  assigning  scores  to  each 
sample  and  thus  may  not  need  to  remember  attributes  in 
successive  samples. 

The  F-values  for  overall  tenderness  for  CS  and  ME  were 
significant  (P<0.05)  and  larger  than  the  F-value  from  the  LS 
analysis  of  variance  (Table  8).  However,  the  LS  F-value  was 
close  to  statistical  significance  (P=0.054).  Thus,  for 
overall  tenderness,  the  CS  and  ME  procedures  were  equally 
sensitive  to  treatment  differences  and  superior  to  the  LS 
technique . 

Initial  tenderness  F-values  for  the  CS  and  LS 
techniques,  respectively,  were  similar  to  those  obtained  for 
overall  tenderness.  However,  the  ME  treatment  F-value  for 
overall  tenderness  was  larger  than  the  corresponding  F-value 
obtained  for  initial  tenderness,  indicating  that  panelists 
were  better  able  to  detect  differences  in  overall  tenderness 
than  in  initial  tenderness  between  the  two  postmortem 
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Although  none  of  the  treatment  F-values  for  connective 
tissue  amount  (Table  8)  was  statistically  significant,  the 
F  values  for  the  CS  and  ME  techniques  were  slightly  larger 
than  that  determined  for  the  LS  method. 

For  overall  juiciness,  treatment  F-values  for  CS  and  ME 
were  similar  and  highly  significant  (P<0.01)  (Table  8). 
Analysis  of  juiciness  data  from  the  LS  technique  for  the  HB 
and  DC  treatments  resulted  in  a  low  nonsignificant  F-value. 
Thus,  the  CS  and  ME  evaluation  procedures  were  more 
sensitive  than  the  LS  me thod  for  the  detection  of  treatment 
differences  related  to  meat  juiciness. 

Flavor  intensity,  a  trait  not  expected  to  be  affected 
by  any  of  the  postmortem  treatments,  had  very  low  treatment 
F-values  for  each  sensory  evaluation  technique  (Table  8). 

In  general,  the  data  (Table  8)  indicate  that  the 
evaluation  techniques  differed  in  their  sensitivity  to 
treatment  differences.  The  CS  technique  was  most  sensitive 
to  treatment  differences  as  indicated  by  the  three 
significant  F-values  obtained  for  initial  tenderness, 
overall  tenderness  and  juiciness.  For  LS ,  F-values  for  each 
of  the  five  attributes  were  not  statistically  significant. 

For  ME,  treatment  F-values  for  overall  tenderness  and 
juiciness  were  significant  and  similar  to  those  obtained  for 
CS;  however,  the  F-value  for  initial  tenderness  was  not 
statistically  significant. 
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In  the  present  study,  the  CS  technique  was  considerably 
more  sensitive  to  differences  in  meat  quality  than  the  LS 
technique.  The  panelists'  inability  to  distinguish 
treatment  differences  by  the  LS  procedure  was  unexpected. 
Stone  et  al.  (1980)  have  reported  success  with  line  scales 
for  "Quantitative  Descriptive  Analysis"  (QDA ) .  However,  few 
studies  have  compared  the  sensitivity  of  the  LS  technique  to 
other  evaluation  methods  such  as  CS. 

Symonds  (1924)  observed  that  although  graphic  (line) 
scaling  allows  a  panelist  to  discriminate  as  finely  as 
desired,  this  scaling  technique  does  not  force  such  a 
judgement.  However,  for  assessments  of  apple  quality, 
judges  distinguished  a  greater  contrast  between  the  two 
apple  varieties  using  LS  than  CS  (Baten,  1946).  In 
addition,  statistical  analyses  of  the  LS  data  resulted  in 
larger  t-values  than  those  for  the  CS  data  (Baten,  1946). 
Variation  in  LS  procedures  between  the  study  of  Baten  (1946) 
and  the  present  study  may  have  contributed  to  the 
differences  in  the  results  obtained  in  the  two  studies. 

Baten  (1946)  allowed  and  encouraged  a  direct  comparison  of 
LS  ratings  between  samples  by  having  panelists  score  the 
apple  quality  of  two  samples  on  adjacent  line  scales. 

However,  in  the  present  study,  a  separate  line  scale 
scorecard  was  used  for  each  sample  under  investigation. 

The  sensitivity  of  a  nine-point  category  scale  and  of  a 
modified  line  scale  (nine  divisions  were  marked)  to 
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differences  in  meat  quality  were  compared  by  Raf f ensperger 
et  al.  (1956).  For  both  CS  and  LS ,  analyses  of  variance 
showed  similar  significant  (P<0.05)  treatment  effects 
(grade)  for  the  three  beef  cuts  evaluated. 

In  the  present  study,  the  ME  technique  was  as  sensitive 
as  CS  to  most  treatment  differences.  Comparisons  of  CS  and 
ME  procedures  have  frequently  been  made  by  untrained  panels 
for  hedonic  eval ua t ions  of  food  pr oduc t s  ( Mos  kow i t z  and 
Sidel,  1971;  McDaniel  and  Sawyer,  1981a;  Moskowitz,  1982). 
However,  few  studies  using  trained  panelists  have  compared 
CS  and  ME  for  the  descriptive  assessment  of  food  attributes. 
McDaniel  and  Sawyer  (1981a)  used  nine-point  CS  and  modulus 
free  ME  to  score  the  intensity  of  19  descriptive  profile 
terms  of  whiskey  sour  formulations.  The  ME  tehnique  yielded 
a  similar  number  of  significant  differences  to  CS  (13 
significant  differences  noted  for  ME;  14  for  CS)  (McDaniel 
and  Sawyer ,  1981a). 

Results  (Table  8)  indicate  that  the  ME  technique  was 
superior  to  the  LS  technique  in  sensitivity  to  differences 
between  the  two  postmortem  meat  treatments.  However,  in  a 
recent  study,  Giovanni  and  Pangborn  (1983)  noted  that  the  LS 
procedure  was  more  sensitive  than  the  ME  procedure  to 
differences  in  intensity  among  six  levels  of  fat  in  milk  (0, 
1,  2,  4,  8,  16  %)  and  seven  concentrations  of  sucrose  in 

lemonade  (4,  6,  8,  10,  14,  20,  30  %).  For  LS  and  ME  data, 
statistically  significant  differences  ( P<0 .001)  were  found 
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among  the  levels  of  fat  in  milk  and  among  the  sucrose 
concentrations  in  lemonade.  However,  for  the  LS  technique, 
treatment  F-values  for  fat  level  and  sucrose  concentration 
(230.9  and  633.1,  respectively)  were  larger  than  the 
corresponding  F-values  for  ME  (152.1  and  282.2, 
respectively)  (Giovanni  and  Pangborn,  1983). 

Replication  F-Values 

Replication  F-values  (Table  9)  for  each  of  the  three 
evaluation  techniques  were  similar  and  nonsignificant  for 
most  of  the  quality  attributes  under  investigation. 

However,  the  CS  replication  F-value  for  overall  juiciness 
was  significant  (P<0.01).  Significant  (P<0.05)  replication 
F-values  for  overall  juiciness  and  flavor  intensity  were 
also  determined  for  ME  data.  For  all  attributes, 
replication  F-values  for  LS  were  lower  than  corresponding 
F-values  for  CS  and  ME. 

The  magnitude  of  the  replication  F-value  reflects  the 
variation  in  scoring  among  the  nine  replications  evaluated. 
For  CS  and  ME,  there  was  a  greater  tendency  for  the  panel  to 
assign  high  scores  at  one  session  and  low  scores  at  another 
than  for  the  LS  technique.  However,  replication  variation 
only  reached  statistical  significance  once  for  CS  and  twice 
for  ME. 

In  a  comparative  evaluation  of  whiskey  sours  by  CS  and 
ME,  McDaniel  and  Sawyer  (1981a)  observed  that  replication 
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Table  9.  F  values  for  selected  sources  of  variation  from  the 
analyses.of  variance  of  category  scale,  line  scale 
and  magnitude  estimation  data. 


Source  of  Variation 

Attribute 

Replication  Panelist  Treatment  by 

Panelist 

Initial  tenderness 


Category  scaling 

1  .35 

4.85*** 

1  .22 

Line  scaling 

1.29 

1 1 . 58  *  *  * 

1  .34 

Magnitude  estimation 

1.51 

15.44*** 

0.43 

Overall  tenderness 

Category  scaling 

1  .58 

4.51*** 

0.82 

Line  scaling 

1  .44 

10.21*** 

0.65 

Magnitude  estimation 

2.11 

36.07*** 

1  .38 

Connective  tissue  amount 

Category  scaling 

2.68 

5.55*** 

0.59 

Line  scaling 

0.35 

1 1 . 40*** 

0.74 

Magnitude  estimation 

2.27 

19.70*** 

1.29 

Overall  juiciness 

Category  scaling 

8 . 10** 

4.22*** 

0.69 

Line  scaling 

1  .33 

7.99*** 

0.99 

Magnitude  estimation 

3.74* 

24.20*** 

2.14* 

Flavor  intensity 

Category  scaling 

1.68 

8.38*** 

1.19 

Line  scaling 

0.88 

9.81*** 

0.77 

Magnitude  estimation 

4.31* 

25 .27*** 

1  .29 

*,  **,  ***  Significant  at  P<0.05  and  P<0.01,  and  P<0.001, 
respectively . 
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F  values  for  CS  were  significant  14  times;  however,  for  ME, 
replications  were  not  significant  in  any  of  the  114 
comparisons  made. 

Panelist  F-Values 

Panelist  F-values  (Table  9)  were  significant  ( p< 0.001 ) 
for  each  quality  attribute  assessed  with  each  of  the  three 
evaluation  methods.  For  each  attribute,  the  CS  technique 
had  the  lowest  panelist  variation;  LS  was  intermediate  and 
ME  had  the  highest  variation.  The  magnitude  of  the  panelist 
F  values  for  each  of  the  evaluation  procedures  probably 
reflects  the  scale  range  used  for  each  technique.  For  CS, 
panelists  worked  within  a  confined  scale  range  of  1  to  8 
units;  for  LS,  the  scale  range  was  0  to  15  units;  and  for 
ME,  panelists  used  any  numbers  they  wanted  to  score  the 
samples . 

Significant  panel  variability  in  sensory  tests  is 
considered  common  and  to  be  expected  (McDaniel,  1974). 

Cross  et  al.  (1978)  determined  significant  panelist  F-values 
for  tenderness,  juiciness  and  connective  tissue  (eight-point 
category  scales)  for  each  of  four  trained  descriptive 
attribute  panels  under  investigation. 

In  an  evaluation  of  whiskey  sour  formulations,  McDaniel 
and  Sawyer  (1981a)  used  trained  panelists  to  compare 
nine-point  CS  and  modulus  free  ME.  One  significant  panelist 
F-value  was  observed  for  ME;  and  83  for  CS.  However,  in 
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their  study  (McDaniel  and  Sawyer,  1981a),  the  ME  data  had 
been  normalized  prior  to  statistical  analysis.  In  the 
present  study,  panelist  variability  (due  to  scale  range)  was 
removed  in  the  analysis  of  variance  procedure  as  the 
panelist  source  of  variation. 

Treatment  by  Panelist  F-Values 

F-values  for  the  treatment  by  panelist  interaction  are 
shown  in  Table  9.  Except  for  the  ME  assessment  of  overall 
juiciness,  all  treatment  by  panelist  F-values  were  not 
significant  indicating  that  individual  panelists  agreed  with 
one  another  when  rating  the  treatments  by  CS,  LS  or  ME. 
Examination  of  the  data  for  overall  juciness  assessments  by 
ME  revealed  that  two  panelists  had  scored  the  DC  and  HB 
treatments  opposite  to  the  scores  assigned  by  the  other  16 
panelists.  Since  these  panelists  had  performed 
satisfactorily  in  evaluating  all  quality  attributes  during 
training  and  in  judging  the  other  attributes  during  the 
study,  no  reason  for  their  method  of  assigning  juiciness 
scores  is  readily  apparent. 

In  general,  the  data  (Tables  8  and  9)  show  that  the 
magnitude  and  significance  of  F-values,  especially  treatment 
F-values,  were  useful  criteria  for  a  comparison  of  the  three 
evaluation  techniques.  Several  workers  (Moskowitz,  1982; 
Giovanni  and  Pangborn,  1983)  have  also  used  treatment  F-values 
as  criteria  for  comparing  sensory  evaluation  techniques. 
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Relationships  Between  Evaluation  Techniques 

In  addition  to  subjecting  the  data  to  analyses  of 
variance ,  correlation  and  regression  analyses  were  performed 
to  assess  relationships  between  each  of  the  evaluation 
techniques.  Data  for  these  analyses  were  averaged  over  all 
panelist  responses  for  each  of  the  18  roasts  (nine  per 
treatment ) . 

Pearson  correlation  coefficients  and  coefficients  of 
determination  between  each  of  the  three  evaluation 
procedures  are  presented  in  Table  10.  For  initial 
tenderness,  correlation  coefficients  for  CS,  LS  and  ME  data 
were  highly  significant  ( P< 0.001 )  and  indicated  strong 
linear  relationships  between  the  scores  assigned  by  judges 
using  each  of  the  evaluation  methods  (Table  10).  Shindell 
(1964)  suggested  that  correlation  coefficients  were  low,  if 
r<0.39 ;  moderate,  if  r  is  between  0.40  and  0.79;  and  high, 
if  r>0.80,  irrespective  of  the  sign.  In  the  present  study, 
the  assignment  of  higher  numbers  indicated  greater 
tenderness  for  both  CS  and  LS .  High  numbers  signified 
toughness  with  ME.  Thus,  the  positive  and  negative 
relationships  (Table  10)  are  as  expected.  Coefficients  of 
determination  show  that  74  to  86  %  of  the  variation  in  the 
scoring  of  initial  tenderness  may  be  explained  by  the 
relationships  between  each  of  the  three  sensory  evaluation 
procedures . 
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Table  10.  Pearson  correlation  coefficients  (r)  and 

coefficients  of  determination  (r2)  between  panel 
_ assessments  from  each  evaluation  technique1. 


Attribute 

r 

r  2 

Initial  tenderness 

CS  vs  LS 

0.93*** 

0 .86 

CS  vs  ME 

-0.86*** 

0.74 

LS  vs  ME 

-0.91*** 

0.82 

Overall  tenderness 

CS  vs  LS 

0.93*** 

0.86 

CS  vs  ME 

-0.92*** 

0.84 

LS  vs  ME 

-0.92*** 

0.84 

Connective  tissue  amount 

CS  vs  LS 

0.54* 

0.29 

CS  vs  ME 

-0.34 

0.12 

LS  vs  ME 

-0.53* 

0.28 

Overall  juiciness 

CS  vs  LS 

0 . 44* 

0.20 

CS  vs  ME 

0 . 74*** 

0.54 

LS  vs  ME 

0.65** 

0.42 

Flavor  intensity 

CS  vs  LS 

0.37 

0.14 

CS  vs  ME 

0 . 59** 

0.35 

LS  vs  ME 

0.36 

0.13 

'Category  scaling  (CS),  line  scaling  ( LS )  and  magnitude 
estimation  (ME) . 

*,  **,  ***  Significant  at  P<0.05,  P<0.01  and  P<0.001, 
respectively . 
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Correlation  coefficients  for  overall  tenderness  between 
each  of  the  evaluation  techniques  were  similar  in  magnitude 
to  those  determined  for  initial  tenderness  (Table  10).  All 
correlations  were  highly  significant  ( P<0 . 00 1 )  and  accounted 
for  84  to  86  %  of  the  variation  in  these  data.  The 
coefficient  of  determination  between  CS  and  ME  data  for 
overall  tenderness  was  10  %  higher  than  that  determined  for 
initial  tenderness.  Figures  9  to  11  illustrate  the  linear 
nature  of  the  relationships  between  overall  tenderness 
assessments  for  each  of  the  three  evaluation  procedures. 

In  a  study  conducted  by  Cross  et  al.  (1978),  four 
panels,  given  similar  training,  evaluated  similar  L  steaks 
using  eight-point  category  scales.  Tenderness  assessments 
were  highly  correlated  across  panels  (r-values  ranged  from 
0.88  to  0.94)  (Cross  et  al.,  1978). 

Recently,  Dransfield  et  al.  (1982)  compared  the  eating 
quality  of  beef  assessed  at  each  of  five  European  research 
institutes.  For  a  common  eight-point  tenderness  scale, 
correlation  coefficients  between  the  trained  panel  scores 
from  each  of  the  five  institutes  ranged  from  0.69  to  0.94. 
When  panel  assessments  using  each  institute's  usual 
tenderness  scale  (six  to  11  categories  in  length)  were 
correlated,  r-values  of  0.73  to  0.94  were  determined 
(Dransfield  et  al.,  1982). 

For  connective  tissue  amount,  correlations  between  the 
three  evaluation  methods  were  moderate  to  low  and  accounted 
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Figure  10.  Relationship  between  panel  overall  tenderness  assessments  by  category  scaling 
and  magnitude  estimation  (N=18).  ^  y  g 
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for  only  12  to  29  %  of  the  variation  in  the  data  (Table  10). 
In  the  present  study,  sample  heterogeneity  may  have 
contributed  to  these  low  correlations  for  connective  tissue 
amount  between  the  evaluation  procedures.  In  addition,  for 
each  evaluation  technique,  mean  connective  tissue  values  for 
the  18  roasts  were  from  a  smaller  stimulus  range  than  the 
corresponding  range  in  means  for  overall  tenderness  for  each 
of  the  roasts  (Appendix,  Table  24,  page  146).  Thus,  for  the 
category  scale,  the  range  in  means  for  the  18  roasts  was  1.8 
units  for  connective  tissue  amount  and  2.8  units  for  overall 
tenderness.  Several  researchers  (Szczesniak,  1968;  Cross  et 
al.,  1978)  have  noted  that  correlation  of  values  from  a 
small  stimulus  range  may  result  in  low  correlation 
coef  f ic ient s . 

Cross  et  al.  (1978)  obtained  r-values  of  0.79  to  0.86 
for  assessments  of  connective  tissue  amount  between  each  of 
four  trained  panels.  Similar  L  steaks  from  11  maturity/ 
marbling  groups  were  evaluated  by  each  panel.  In  the 
present  study,  the  SM  steaks  were  obtained  from  young 
animals  (15  mo)  of  similar  grade  (Al)  and  thus  might  not  be 
expected  to  contain  wide  ranges  of  connective  tissue. 

Correlation  coefficients  for  overall  juiciness  between 
each  of  the  sensory  procedures  were  moderate  (Table  10). 
Since  all  steaks  were  cooked  to  the  same  final  internal 
temperature,  the  range  of  juiciness  scores  for  each 
evaluation  technique  was  small  (Appendix,  Table  24,  page 
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146).  For  steaks  cooked  to  one  degree  of  doneness,  several 
workers  (Cross  et  al.,  1978;  Dransfield  et  al.,  1982)  have 
also  reported  that  correlations  of  juiciness  between 
panels  were  moderate  to  low. 

Flavor  intensity  aata  yielded  low  to  moderately  low 
correlation  coefficients  between  each  of  the  three 
evaluation  techniques  (Table  10).  Thirteen  to  35  %  of  the 
variation  in  the  data  could  be  explained  by  the 
relationships  between  data  from  each  procedure.  A  small 
stimulus  range  for  flavor  intensity  was  noted  since  steaks 
from  the  two  postmortem  treatments  received  similar  flavor 
scores  (Appendix,  Table  24,  page  146). 

Dransfield  et  al.  (1982)  correlated  flavor  intensity 
scores  between  each  of  three  European  research  institutes 
and  obtained  r-values  (0.41  to  0.51)  similar  to  those  of  the 
present  study  (Table  10). 

In  general,  correlation  coefficients  betweeen  each  of 
the  evaluation  techniques  (Table  10)  for  initial  and  overall 
tenderness  were  highly  significant  and  indicated  strong 
linear  relationships  between  scores  assigned  by  judges  using 
CS,  LS  and  ME.  For  the  remaining  attributes,  the 
correlation  between  each  of  the  three  methods,  while 
sometimes  significant,  were  moderate  to  low.  In  the  present 
study,  sample  heterogeneity,  especially  for  connective 
tissue  amount,  and  a  narrow  stimulus  range,  particularly  for 
juiciness  and  flavor  intensity,  may  have  contributed  to  the 
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low  correlations  obtained  (Szczesniak,  1968;  Cross  et  al., 
1978)  . 

Table  11  summarizes  coefficients  of  determination  for 
tenderness  assessments  from  each  of  the  three  evaluation 
techniques  fitted  to  five  mathematical  functions.  The 
coefficients  of  determination  obtained  between  each  of  the 
evaluation  techniques  were  similar.  None  of  the 

mathematical  functions  provided  a  consistently  superior  fit 
of  the  data. 

For  the  parabolic  functions  between  CS  and  ME 
tenderness  data,  the  quadratic  (X2)  term  did  not  make  a 
significant  contribution  to  the  function.  Thus,  for  initial 
and  overall  tenderness,  the  relationships  between  CS  and  ME 
data  are  not  curvilinear.  The  linear  nature  of  the 
relationship  betweem  CS  and  ME  data  for  overall  tenderness 
was  shown  in  Figure  10,  page  90.  In  contrast,  Cardello  et 
al.  (1982b)  rescaled  six  standard  texture  scales  using  ME 
and  found  that  the  CS  data  was  concave  downward  relative  to 
the  ME  data.  This  indicated  that  at  higher  intensities  of 
each  attribute  evaluated,  CS  was  less  sensitive  than  ME  to 
differences  in  these  attributes  (Cardello  et  al.,  1982b). 

In  the  present  study,  however,  data  for  the  evaluation  of 
beef  samples  using  CS  did  not  cover  the  entire  stimulus 
range  of  the  scale  and  thus  may  have  influenced  the  shape  of 
the  resulting  functions. 
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Relationships  Between  Sensory  and 
Measures  of  Tenderness 


Instrumental 


Correlation  coefficients  and  coefficients  of 
determination  between  panel  tenderness  assessments  from  each 
evaluation  procedure  and  Warner  Bratzler  shear  data  are 
presented  in  Table  12.  For  each  evaluation  technique, 
correlations  of  initial  and  overall  tenderness  with  Warner 
Bratzler  shear  values  yielded  similar  significant  r-values 
(Table  12).  The  positive  and  negative  relationships  are  as 
would  be  expected  since  for  CS  and  LS  data,  high  numbers 
indicate  tenderness;  and  for  ME  and  shear  force  data,  high 
numbers  signify  toughness. 

For  CS  and  LS  data,  r-values  between  initial  tenderness 
and  shear  were  similar  to  corresponding  r-values  between 
overall  tenderness  and  shear.  However,  the  data  for  the  ME 
technique  (Table  12)  suggest  that  the  relationship  between 
overall  tenderness  and  shear  was  slightly  better  than  that 
for  initial  tenderness  and  shear. 

Even  though  all  correlations  of  subjective  tenderness 
data  with  shear  force  measurements  were  statistically 
significant,  the  correlations  only  accounted  for  40  to  51  % 
of  the  variation  in  the  data.  Sample  heterogeneity  may  have 
contributed  to  the  unexplained  data  variation  (Kapsalis  and 
Moskowitz,  1977).  In  the  present  study,  the  high 
unexplained  variation  in  the  data  may  be  a  reflection  of  the 
lack  of  sensitivity  of  the  Warner  Bratzler  shear  to 
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Table  12.  Pear son ^ cor relat ion  coefficients  (r)  and 

coefficients  of  determination  (r2)  between  panel 
tenderness  assessments  from  each  evaluation 
technique1  and  Warner  Bratzler  shear  data. 


Attribute 

r 

r  2 

Initial  tenderness 

CS 

“0.71*** 

0.51 

LS 

- 0 . 68  *  *  * 

0.47 

ME 

0.63** 

0.40 

Overall  tenderness 

CS 

“0 . 68  *  *  * 

0.46 

LS 

-0.69*** 

0.48 

ME 

0.70*** 

0.49 

’Category  scaling  (CS), 

line 

scaling  (LS) 

and  magnitude 

estimation  (ME ) . 

**,  ***  Significant  at 

P<0.01 

and  P<0 .001, 

respectively . 
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structural  components  that  influence  taste  panel 

assessments.  Gullett  et  al.  (1983)  reported  that  shear 

values  may  not  be  related  to  panelists’  perceptions  of  meat 
tenderness . 


Correlations  between  Warner  Bratzler  shear  values  and 
sensory  tenderness  assessments  vary  considerably  in  the 
literature  (Szczesniak  and  Torgeson,  1965).  In  addition, 
correlation  coefficients  are  difficult  to  compare  between 
studies.  T he  magnitude  of  the  correlation  coefficient 
obtained  in  any  study  is  highly  dependent  upon  the  range  of 
values  covered,  the  number  of  samples  involved  and  the 
amount  of  variation  within  the  block  of  samples  (Szczesniak, 
1968) . 

For  the  SM  muscle,  correlation  analysis  of  tenderness 
assessments  by  the  CS  technique  with  shear  values  have 
resulted  in  r-values  of  -0.81  (Breidenstein  et  al.,  1968)  to 
-0.53  (Smith  et  al.,  1978)  or  lower  (McCurdy  et  al.,  1981). 
However,  studies  reporting  correlations  of  panelist 
assessments  of  tenderness  using  the  LS  method  with  Warner 
Bratzler  shear  values  are  lacking.  In  addition,  no  reports 
have  correlated  ME  tenderness  values  with  Warner  Bratzler 
shear  data.  However,  Segars  et  al.  (1975)  reported 
correlation  coefficients  ranging  from  0.92  to  0.98  for 
texture  evaluations  (difficulty  of  cutting,  chewiness  and 
residue)  by  the  ME  procedure  and  an  instrumental  measure  of 
tenderness  (punch  and  die  test  cell).  Since  only  six 
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muscles  representing  extremes  in  tenderness  were  used,  the 
results  of  this  study  must  be  interpreted  with  caution. 
Recently,  Segars  et  al.  (1981)  obtained  a  correlation  of 
0.76  between  magnitude  estimates  of  hardness  in  irradiated 
restructured  meat  products  and  a  punch  and  die  shear  cell. 

Correlations  of  sensory  scores  and  mechanical 
measurements  have  been  used  as  an  indicator  of  panel 
accuracy  (Hovenden  et  al.,  1979).  Results  of  the  present 
research  (Table  12)  show  that  each  of  the  evaluation 
techniques  yeilded  similar  significant  r-values. 

Summarized  in  Table  13  are  coefficients  of 
determination  between  panel  tenderness  assessments  from  each 
evaluation  technique  and  Warner  Bratzler  shear  data  fitted 
to  five  mathematical  functions.  Similar  coefficients  of 
determination  between  sensory  assessments  of  tenderness  and 
shear  data  were  determined  for  each  of  the  five  mathematical 
functions.  Thus,  the  CS ,  LS  and  ME  data  can  be  explained 
equally  well  by  each  of  these  functions.  Furthermore, 
linear,  logarithmic,  hyperbolic  and  parabolic  functions 
described  ME  data  as  well  as  the  more  traditionally  used 
power  function.  This  finding  agrees  with  a  recent  report  by 
Giovanni  and  Pangborn  (1983). 

The  power  functions  between  panel  tenderness 
assessments  using  ME  and  Warner  Bratzler  shear  data  are 
illustrated  in  Figure  12.  The  exponent  or  slope  of  the 
power  function  is  thought  to  be  an  index  of  perceptual 


* 


. 

. 

. 

. 

' 


' 

'ImI 


100 


x 

r 

0 


0 

3 

c r 

c 

x 

o 

0) 


CM 


U  X 

u 

—  + 

r^ 

o 

ID 

O) 

0) 

o  x 

X  X 

in 

rr 

0  + 

X  0 

0  II 

b 

O 

o 

o 

o 

o 

CL  > 


C 

o 


(0 

3 

(0 

> 

0) 

x 

u 

(0 

0 


o  ~ 

—  X 
■ - s 


o  ■>- 

o 

!"• 

in 

X  w 
l-  X 

in 

CO 

0  + 
a  0 
>  n 

o 

o 

o 

O 

o 

c 

X  > 


E 

O 

X 


C 

o 


V) 

F 

c 

0 

E 

0 

m 

0  • 
0  in 
0  c 
®  o 

in  f 
0  O 
0  C 
C  3 
X  F 
0 

X  - 

c  0 

0  u 


—  re 

0  E 
c  0 
0  x 

Q.  F 
0 

C  E 

0 

0  0 

3  > 

F  -r- 

0  F 
X 

0 


DC  X 

W  0) 


c  — 

O  f 

F  0 
0  F 
C  0 

-  X 

E 

X  X 
0  0 
f  a3 
0  x 

X  0 

F  X 

O  0 

0  N 
F  F 
C  0 
0  X 

—  CQ 
O 

<-  X 
F  0 
F  C 
0  X 
O  0 

a  s 


CO 

▼— 

0 

x 

ro 


•F 

O 

c 

3 


0 

o 


0 

E 

0 

X 

F 

0 


U 

■<-  X 
£  C 


X  — 

T- 

ID 

05 

CD 

1" 

00 

F  X 

in 

"tf 

CO 

—  + 

X  0 

O 

o 

o 

o 

O 

O 

0  II 

D)> 

o 


X 

CO 

r- 

r- 

00 

r- 

0  x 

in 

co 

'S' 

3  0 

O  II 

O 

O 

O 

O 

o 

O 

CL  > 


X  X 


0  X 

T- 

t" 

o 

CD 

CO 

05 

0  + 

in 

^r 

"sr 

'J 

C  0 

--  II 

o 

o 

O 

o 

o 

o 

_l  > 


0 

F 

3 

X 

X 


< 


0 

0 

0 

c 

X 

0 

X 

C 

0 


0 

0 

0 

c 

X 

0 

X 

C 

0 


V) 

(J 


I/) 


1 —  I/O  I/O  LU 

-  O  —I  2 

0 
x 
0 
> 

O 


LU 


.  ► 

c 

■*- 

0 

o 

•r- 

•r— 

o  o 

F 

0 

o  o 

E 

V 

V 

-f- 

a 

X 

F 

0 

F 

F 

0 

0 

0 

0 

F 

F 

X 

c 

c 

3 

0 

0 

F 

0 

a 

i“ 

-r- 

t- 

C 

F 

F 

O) 

~r~ 

~r- 

0 

c 

c 

E 

O) 

O) 

-i— 

X 

0 

0 

c 

0 

0 

0 

X 

X 

y — - 

0 

0 

1/5 

3 

3 

X 

w 

CD 

CO 

CO 

05 

C 

o 

o 

f 

r— 

VI 

VI 

0 

O 

c 

c 

0 

0 

0 

-r- 

*r— 

0 

F 

F 

c 

0 

0 

f- 

c 

c 

f— 

•1— 

•r- 

E 

E 

► 

X 

X 

/"V 

a> 

0 

C/5 

F 

F 

o 

0 

0 

w 

X 

X 

05 

F 

F 

C 

0 

O 

0 

0 

0 

F 

F 

o 

c 

c 

0 

0 

0 

•r— 

•r— 

> 

O 

a 

X 

*r- 

0 

F 

F 

05 

F 

F 

0 

0 

0 

F 

0 

0 

0 

U 

CJ 

CJ 

«M 


' 


Overall  Tenderness  LN  geom.  mean  ME  Initial  Tenderness  LN  geom.  mean  ME 


101 


Warner  Bratzler  Shear,  LN  Kg 

Figure  12.  Power  functions  between  panel  (a)  initial 

tenderness  and  (b)  overall  tenderness  assessments 
by  magnitude  estimation  (ME)  and  Warner  Bratzler 
shear  (N= 1 8 ) . 


- 


■ 


102 


sensitivity  .  For  initial  tenderness  and  overall  tenderness, 
the  power  functions  relating  ME  and  shear  data  yielded 
exponents  of  1.13  and  1.05,  respectively.  These  exponents 
indicate  that  panel  tenderness  assessments  using  ME  and  the 
Warner  Bratzler  shear  determinations  on  the  Ottawa  Texture 
Mea sur i ng  System  were  similar  in  sensitivity  to  tenderness 
differences.  However,  because  of  data  variability  shown  in 
Figure  12,  the  standard  errors  for  initial  tenderness  (0.37) 
and  overall  tenderness  (0.28)  are  large.  Thus  the  exponents 
determined  in  the  present  study  may  not  be  reliable 
indicators  of  sensory  response.  Segars  et  al.  (1975) 
reported  that  exponents  relating  textural  properties  of  meat 
to  an  instrumental  measure  (punch  and  die  shear  device) 
ranged  from  1.8  to  2.6.  However,  comparison  of  exponents 
between  studies  is  difficult  because  exponents  may  be 
dependent  on  stimulus  range  and  response  range  (Birnbaum, 
1982)  . 

Scale  Preference 

Mean  values  for  panelist  rankings  of  selected  criteria 
for  the  three  evaluation  techniques  are  presented  in  Table 
14.  In  addition,  Table  15  provides  data  regarding  panelist 
response  by  rank  for  each  evaluation  procedure.  Comments 
provided  by  panelists  using  the  open-ended  questionaire 
(Appendix,  Figure  17,  page  136)  have  been  incorporated  into 
the  discussion  of  each  criterion. 


. 
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Table  14.  Mean  values1  for  panelist  ranking  of  selected 

criteria  for  category  scaling,  line  scaling  and 
magnitude  estimation. 


Evaluation  Technique 

Criteria  2 

Category 

Scaling 

Line 

Scaling 

Magnitude 

Est imat ion 

Ease  of  learning 

1.4b 

1  ,6b 

3 . 0a 

Appl icabi 1 i ty 

1.8 

1  .8 

2.2 

Scale  accuracy 

1  .3c 

2.  1b 

2 . 6a 

Effort  for 

Sample  evaluation 

2 . 4a 

2.4a 

1  .2b 

Scale  preference 

1.4c 

2.  1b 

2.7a 

’Values  are  the  mean  of  ranks  assigned  by  the  18  panelists. 
For  applicability,  values  are  the  mean  of  ranks  assigned 
by  17  panelists. 

2Ranking  based  on: 

1=easiest,  most  applicable,  most  accurate,  most  effort  and 
most  preferred; 

3=most  difficult,  least  applicable,  least  accurate,  least 
effort  and  least  preferred. 

a,b,c  Means  within  the  same  row  sharing  a  common  superscript 
are  not  significantly  different  at  P<0.05. 
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Table  15.  Frequency  of  panelist  ranks1  assigned  to  each  of 
the  evaluation  techniques  for  selected  criteria. 


Frequency  of  Ranks1 


Criteria 

Rank 

Category  Line  Magnitude 

Scaling  Scaling  Estimation 

Ease 

1 

(easiest ) 

56 

44 

0 

of 

2 

44 

56 

0 

learning 

3 

(most  difficult) 

0 

0 

100 

Appl icabi 1 i ty 

1 

(most ) 

35 

4  1 

24 

for  evaluation 

2 

47 

35 

18 

of  beef 

3 

( least ) 

18 

24 

58 

Scale 

1 

(most ) 

78 

22 

0 

accuracy 

2 

17 

44 

39 

3 

( least ) 

6 

33 

61 

Effort  needed 

1 

(most ) 

1  1 

0 

89 

to  evaluate 

2 

39 

56 

6 

each  sample 

3 

( least ) 

50 

44 

6 

Scale 

1 

(most ) 

72 

17 

1  1 

preference 

2 

17 

61 

22 

3 

(least ) 

1  1 

22 

67 

’Frequencies  of  ranks  are  expressed  as  a  percentage  of 
panelists.  For  most  criteria,  ranks  were  assigned  by 
18  panelists;  however,  for  applicability,  only  17 
panelists  provided  ranks. 
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The  CS  and  LS  techniques  were  ranked  easier  (P<0.001) 
to  learn  than  the  ME  technique.  Fifty-six  percent  (10)  of 
the  panelists  ranked  CS  as  easiest  to  learn;  the  remaining 
44  /0  (8)  ranked  the  LS  procedure  as  the  easiest  to  learn 
(Table  15).  However,  all  panelists  ranked  ME  as  the  most 
difficult  of  the  three  evaluation  techniques  to  learn.  One 
panelist  stated  that  the  principle  of  ME  was  easy  to 
understand  but  that  its  application  to  meat  was  difficult. 

Each  evaluation  method  received  similar  mean  ranks  for 
scale  applicability  (Table  14).  However,  panelist  responses 
(Tables  14  and  15)  suggest  that  the  CS  and  LS  procedures  may 
have  been  considered  slightly  more  applicable  for  the 
sensory  evaluation  of  beef  than  the  ME  technique. 

For  scale  accuracy,  mean  ranks  for  each  of  the  three 
sensory  evaluation  methods  differed  significantly  (P<0.001). 
The  CS  method  was  ranked  as  the  most  accurate;  LS  was  ranked 
intermediate  and  ME  was  ranked  as  the  least  accurate  sensory 
procedure . 

Panelists  commented  that  they  could  be  more  consistent 
in  scoring  the  five  descriptive  attributes  of  beef  with  CS 
than  with  either  LS  or  ME.  One  panelist  observed  that  it 
was  more  difficult  to  score  along  the  centre  portion  of  the 
line  scale  than  at  positions  near  each  end.  Many  panelists 
commented  that  the  magnitude  estimates  they  assigned  to  the 
quality  attributes  varied  considerably  from  day  to  day. 

While  this  day  to  day  variability  was  removed  in  the 
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statistical  procedures  employed  for  data  analysis,  concern 
about  the  variability  of  the  ME  responses  is  probably 
reflected  in  the  low  ME  rankings  assigned  by  judges  for 
scale  accuracy. 


Both  the  CS  and  LS  techniques  required  similar  and  less 
effort  ( P<0 . 001 )  for  sample  evaluation  than  ME  (Table  14). 
Many  panelists  commented  that  evaluation  with  ME  required 
intense  concentration  in  order  to  assess  quality  attributes 
of  one  sample  relative  to  another. 

Usually,  the  descriptive  sensory  assessment  of  beef 
involves  the  simultaneous  evaluation  of  several 
characteristics.  This  type  of  assessment  may  be  more 
difficult  with  ME  than  with  either  CS  or  LS,  and  may  result 
in  decreased  sensitivity  to  treatment  differences.  Segars 
et  al.  (1981)  modified  their  ME  procedure  for  irradiated 
beef,  ham  and  poultry  rolls  in  order  to  reduce  memory 
interference  and  to  make  ME  evaluations  easier  for  the 
panelists  (Cardello,  personal  communication).  Originally, 
panelists  were  instructed  to  evaluate  four  attributes  in 
each  sample  of  the  series  presented;  in  a  later  phase  of  the 
investigation,  panelists  evaluated  a  separate  series  of 
samples  for  each  of  three  attributes  (Segars  et  al.,  1981). 

Scale  preference  differed  significantly  (P<0.001)  among 
the  evaluation  techniques  (Table  14).  Seventy-two  percent 
of  the  panelists  preferred  CS  to  either  LS  or  ME  (Table  15). 
LS  was  ranked  intermediate  in  preference  and  ME  was  least 
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preferred. 

The  effect  of  scale  preference  on  panelist  motivation 
is  unknown.  If  a  panelist  is  not  comfortable  with  the  task, 
sensory  performance  may  be  affected.  However,  Baten  (1946) 
reported  that  even  though  panelists  preferred  the  CS 
technique  to  the  LS  technique,  their  discrimination  of 

differences  between  apple  varieties  was  greater  with  LS  than 
with  CS. 

In  general,  the  data  (Tables  14  and  15)  indicate  that 
the  CS  method  was  easy  to  learn,  accurate,  required  little 
effort  for  sample  evaluation  and  was  preferred  by  panelists 
for  the  descriptive  sensory  assessment  of  beef.  The  LS 
procedure  was  also  considered  easy  to  learn,  required  little 
effort  for  sample  evaluation  and  was  ranked  intermediate  in 
scale  accuracy  and  preference.  However,  panelists  found 
that  the  ME  technique  was  more  difficult  to  learn,  less 
accurate  and  required  more  effort  for  sample  evaluation  than 
the  other  techniques.  In  addition,  the  ME  procedure  was 
least  preferred  by  panelists  for  sensory  assessments  of 
cooked  meat  quality. 
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SUMMARY  AND  CONCLUSIONS 


Three  sensory  evaluation  techniques  employed  for  the 
descriptive  sensory  assessment  of  beef  were  compared:  ( 1 ) 
category  scaling  (CS)  which  is  most  commonly  used  by  meat 
scientists,  (2)  line  or  semi  structured  scaling  (LS)  which 
forms  the  basis  for  "Quantitative  Descriptive  Analysis"  and 
(3)  magnitude  estimation  (ME)  which  is  becoming  popular  for 
food  research  but  has  not  been  widely  utilized  with  meat. 

Eighteen  trained  panelists  evaluated  the  initial 
tenderness,  overall  tenderness,  connective  tissue  amount, 
overall  juiciness  and  flavor  intensity  of  semimembranosus 
(SM)  steaks  using  an  eight-point  category  scale,  a  15  cm 
line  scale  with  anchors  1.3  cm  from  each  end  and  modulus 
free  magnitude  estimation. 

Panelists  were  screened  and  then  intensively  trained  in 
the  use  of  each  evaluation  technique,  panel  procedures  and 
the  quality  attributes  under  investigation. 

Semimembranosus  roasts  used  for  the  study  were  from 
four  postmortem  meat  treatments  (delay  chilling  (DC),  delay 
chilling  with  electrical  stimulation  (DC-ES),  hot  boning 
(HB)  and  hot  boning  with  electrical  stimulation  (HB-ES)) 
selected  to  provide  differences  in  organoleptic  quality. 

A  total  of  108  steaks,  three  from  each  of  36  roasts, 
were  individually  roasted  at  176°  to  an  internal  temperature 
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of  65  .  To  minimize  the  effects  of  scale  carry  over, 
learning  and  motivational  factors,  panelists  utilized  each 
evaluation  technique  in  sequence  in  different  predetermined 
orders.  Each  judge  evaluated  one  of  three  adjacent  steaks 
from  each  of  nine  roasts  per  treatment  using  each  evaluation 
technique . 

In  addition  to  sensory  evaluation  of  the  samples, 
visual  evaluation  of  steak  doneness  and  objective 
measurements  of  tenderness  (Warner  Bratzler  shear  on  the 
Ottawa  Texture  Measuring  System),  juiciness  (percentage 
press  fluid)  and  color  (Hunter  Color  Difference  Meter)  were 
also  made. 

Few  treatment  differences  attributable  to  ES  were 
found.  Therefore,  the  comparison  of  the  three  sensory 
evaluation  techniques  focused  on  differences  obtained 
between  two  of  the  postmortem  treatments  (DC  and  HB). 
Criteria  used  to  compare  the  three  evaluation  techniques 
included  an  examination  of:  the  objective  and  sensory  data 
for  the  two  postmortem  treatments,  the  sensitivity  of  the 
evaluation  methods  to  differences  in  quality  attributes 
between  the  two  postmortem  treatments,  the  replication  and 
panelist  variation,  the  significance  of  the  treatment  by 
panelist  interaction,  the  relationships  between  the  three 
evaluation  procedures,  the  relationships  between  tenderness 
assessments  and  Warner  Brazier  shear  data,  and  scale 
preference . 
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Chemical  data,  raw  weight,  cooking  time,  final  internal 
temperature  and  cooking  losses  for  SM  steaks  showed  no 
differences  attributable  to  DC  and  HB.  However,  HB  steaks 
had  a  significantly  greater  percentage  drip  in  thaw  then 
comparable  DC  steaks.  In  addition,  HB  steaks  were  more  rare 
than  comparable  DC  steaks  as  determined  subjectively  (degree 
of  doneness)  and  objectively  (Hunter  3  values,  percentage 
press  fluid) . 

When  the  18  trained  panelists  employed  CS ,  HB  steaks 
were  rated  lower  (P<0.05)  in  initial  tenderness  and  overall 
tenderness  and  higher  ( P< 0.01)  in  overall  juiciness  than 
comparable  DC  steaks.  Using  LS ,  there  were  no  significant 
differences  in  the  quality  attributes  of  the  SM  steaks  due 
to  postmortem  treatment.  With  the  ME  technique,  the  HB 
steaks  were  less  tender  (P<0.05)  and  more  juicy  ( P< 0.01) 
than  comparable  DC  steaks. 

Although  there  were  differences  in  statistical 
significance,  the  direction  and  extent  of  the  difference 
between  mean  scores  assigned  to  each  of  the  descriptive 
attributes  in  the  DC  and  HB  steaks  were  consistent  among  the 
three  evaluation  techniques.  For  each  evaluation  method, 
differences  between  steak  treatment  means  tended  to  be  large 
for  initial  tenderness  and  overall  tenderness,  intermediate 
for  amount  of  connective  tissue  and  juicines,  and  slight  for 
flavor  intensity.  This  consistency  in  scoring  between  each 
of  the  three  evaluation  techniques  provides  an  internal 
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validation  for  the  results  of  the  present  study. 

Sensory  assessments  of  tenderness  tend  to  be  supported 
by  Warner  Bratzler  shear  data.  Although  the  difference  was 
not  statistically  significant,  the  average  shear  force  value 
for  HB  steaks  was  higher  than  that  of  comparable  DC  steaks. 
Since  steaks  from  the  DC  and  HB  treatments  attained  the  same 
average  internal  temperature  of  65.1°  (range,  64.75°  to 
65.50  ) ,  juiciness  was  not  expected  to  differ  between  the 
two  postmortem  treatments.  However,  the  significant 
differences  for  juiciness  observed  by  panelists  using  CS  and 
ME  are  supported  by  objective  measurements.  The  HB  steaks 
were  visually  more  rare  ( P<0 .01),  had  higher  ( P<0 .001) 

Hunter  a  (redness)  values  and  greater  (P<0.01)  percentage 
press  fluid  than  their  DC  counterparts. 

An  examination  of  the  distributions  of  panelist 
responses  for  overall  tenderness  for  the  two  postmortem  meat 
treatments  revealed  that  for  each  evaluation  technique,  HB 
steaks  tended  to  be  less  tender  than  comparable  DC  steaks. 
These  distributions  support  results  from  the  analysis  of 
variance  of  these  data  for  CS  and  ME  which  showed 
significant  differences  in  overall  tenderness  attributable 
to  postmortem  treatment. 

The  magnitude  and  significance  of  F-values  for  selected 
sources  of  variation  from  the  split-plot  analyses  of  CS ,  LS 
and  ME  data  were  examined.  Treatment  F-values  were  used  as 
a  measure  of  the  sensitivity  of  each  evaluation  technique  to 
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differences  in  palatability  due  to  postmortem  treatment. 

The  CS  procedure  was  most  sensitive  to  treatment  differences 
as  indicated  by  the  three  significant  F-values  obtained  for 
initial  tenderness,  overall  tenderness  and  juiciness.  The 
LS  technique  was  least  sensitive  in  detecting  differences  in 
steak  quality  attributes.  No  significant  treatment  F-values 
for  LS  were  determined.  Magnitude  estimation  was  as 
sensitive  as  CS  to  most  treatment  differences.  For  ME, 
treatment  F-values  for  overall  tenderness  and  juiciness  were 
significant  and  similar  to  those  obtained  by  CS;  however, 
the  F -value  for  initial  tenderness  was  not  statistically 
signi f icant . 

Replication  F-values  for  each  of  the  three  evaluation 
techniques  were  similar  and  nonsignificant  for  most  of  the 
quality  attributes  under  investigation.  However, 
replication  variation  reached  statistical  significance  once 
for  CS  (overall  juiciness)  and  twice  for  ME  (overall 
juiciness  and  flavor  intensity)  For  all  attributes, 
replication  F-values  were  lower  for  the  LS  method  than 
corresponding  F-values  for  CS  and  ME. 

Panelist  F-values  were  significant  (P<0.001)  for  each 
quality  attribute  assessed  with  each  of  the  three  evaluation 
methods.  The  magnitude  of  the  panelist  F-values  appeared  to 
reflect  the  scale  range  used  for  each  procedure.  The  CS 
technique,  with  a  confined  scale  range  of  eight  categories, 
had  the  lowest  panelist  variation.  The  LS  method,  with  a 
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scale  range  of  15  cm,  was  intermediate  in  panelist 
variation.  The  ME  technique,  in  which  panelists  were  free 

to  assign  any  value  they  wanted,  had  the  highest  panelist 
variation. 

Except  for  the  ME  assessment  of  overall  juiciness, 
F-values  for  the  treatment  by  panelist  interaction  were  not 
significant,  indicating  that  individual  panelists  agreed 
with  one  another  when  rating  the  treatments  by  CS,  LS  or  ME. 

Correlation  and  regression  analyses  were  performed  to 
assess  relationships  between  each  of  the  evaluation  methods 
and  between  sensory  and  instrumental  measures  of  tenderness. 
Data  for  these  analyses  were  averaged  over  all  panelist 
responses  for  each  of  the  18  roasts  (nine  per  treatment). 

Correlation  coefficients  between  each  of  the  evaluation 
procedures  for  initial  and  overall  tenderness  were  highly 
significant  ( P< 0 .001)  and  indicated  strong  linear 
relationships  between  scores  assigned  by  judges  using  CS,  LS 
and  ME.  For  the  remaining  attributes,  the  correlations 
between  each  of  the  three  techniques,  while  sometimes 
significant,  were  moderate  to  low.  Sample  heterogeneity, 
especially  for  connective  tissue  amount,  and  a  narrow 
stimulus  range,  particularly  for  juiciness  and  flavor 
intensity,  may  have  contributed  to  the  low  correlations 
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Coefficients  of  determination  obtained  for  tenderness 
assessments  from  CS,  LS  and  ME  fitted  to  each  of  five 
mathematical  functions  were  similar.  None  of  the 
ma uhemat ical  functions  provided  a  consistently  superior  fit 
of  the  data. 

Correlations  of  tenderness  data  from  each  evaluation 
method  with  Warner  Bratzler  shear  values  yielded  similar 
significant  r-values.  However,  these  correlations  only 
accounted  for  40  to  51  %  of  the  variation  in  the  data. 

Warner  Bratzler  shear  determinations  may  not  be  sensitive  to 
the  same  structural  components  which  influence  taste  panel 
assessments . 

Similar  coefficients  of  determination  between  sensory 
assessments  of  tenderness  and  Warner  Bratzler  shear  data 
were  determined  for  each  of  the  five  mathematical  functions. 
Thus,  the  CS ,  LS  and  ME  data  can  be  explained  equally  well 
by  each  of  these  functions.  Furthermore,  linear, 
logarithmic,  hyperbolic  and  parabolic  functions  described  ME 
data  as  well  as  the  more  traditionally  used  power  function. 

The  power  functions  relating  the  texture  judged  by 
panelists  using  ME  to  the  textural  properties  of  meat 
assessed  instrumentally  can  provide  additional  information 
about  the  perception  of  meat  tenderness,  which  is  not 
possible  with  either  the  CS  or  LS  techniques.  For  initial 
and  overall  tenderness,  the  power  functions  between  ME  and 
shear  data  yielded  exponents  of  1.13  and  1.05,  respectively. 
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These  exponents  (which  are  close  to  1)  indicate  that  panel 
assessments  using  ME  and  Warner  Bratzler  shear 
determinations  were  similar  in  sensitivity  to  tenderness 
differences  in  the  meat  samples.  However,  because  of  data 
variability,  these  exponents  may  not  be  reliable  indicators 
of  sensory  response. 

The  evaluation  techniques  were  ranked  by  the  18  trained 
panelists  for  selected  criteria.  The  CS  and  LS  methods  were 
easier  to  learn  and  required  less  effort  for  sample 
evaluation  than  ME.  Each  evaluation  procedure  received 
similar  mean  ranks  for  scale  applicability.  However,  CS  was 
ranked  as  most  accurate  for  the  sensory  evaluation  of  beef; 
LS  was  intermediate;  and  ME  was  judged  the  least  accurate 
sensory  procedure.  In  addition,  panelists  preferred  CS  over 
both  LS  and  ME.  The  LS  technique  was  ranked  intermediate  in 
preference  and  ME  was  least  preferred.  The  effect  of  scale 
preference  on  motivation  is  unknown.  If  a  panelist  is  not 
comfortable  with  the  task,  sensory  performance  may  be 
af  fee ted . 

Results  of  this  work  indicate  that  for  the  descriptive 
sensory  assessment  of  SM  steaks,  the  CS  evaluation 
technique,  most  frequently  used  to  determine  meat  quality, 
was  easy  to  learn,  required  little  effort  for  sample 
evaluation,  was  preferred  by  panelists  and  was  most 
sensitive  to  treatment  differences.  The  LS  method  was  easy 
to  learn,  required  little  effort  for  sample  evaluation  and 
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was  ranked  intermediate  in  preference.  However,  panelist 
evaluations  by  the  LS  procedure  did  not  result  in  treatment 
differences  for  any  of  the  attributes  scored.  Although 
panelists  found  that  the  ME  technique  was  more  difficult  to 
learn,  required  more  effort  for  sample  evaluation  and  was 
least  preferred,  ME  was  as  sensitive  as  the  CS  method  to 
most  treatment  differences.  Therefore,  findings  from  this 
research  generally  support  the  use  of  category  scaling  for 
the  descriptive  sensory  assessment  of  beef  by  trained 
panelists . 

The  heterogeneous  meat  system  employed  in  the  present 
study  is  typical  of  that  usually  studied  by  meat 
researchers.  Further  comparative  studies,  using  a  less 
heterogeneous  meat  system  such  as  a  restructured  meat 
product,  may  provide  additional  information  on  the  accuracy 
and  precision  of  panelist  assessments  from  each  of  these 
evaluation  techniques. 

The  lack  of  significant  treatment  differences  by  LS  was 
unexpected.  The  line  scale  has  been  successfully  used  for 
the  "Quantitative  Descriptive  Analysis"  of  a  number  of  food 
products.  Further  work  could  be  done  to  systematically 
evaluate  the  LS  procedure  for  the  descriptive  sensory 
assessment  of  meat.  An  evaluation  of  the  direct  comparison 
of  LS  ratings  between  samples  on  the  same  scorecard  versus 
the  use  of  a  separate  scorecard  for  each  sample  would  be 
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Panelists  stated  that  the  principle  of  magnitude 
estimation  was  easy  to  understand  but  that  its  application 
to  meat,  was  difficult.  The  difficulties  encountered  in  the 
present  study  with  the  use  of  ME  for  meat  have  not 
previously  been  described  in  the  literature.  Usually,  the 
descriptive  sensory  assessment  of  beef  involves  the 
simultaneous  evaluation  of  several  characteristics.  This 
type  of  assessment  may  be  more  difficult  with  ME  than  with 
either  CS  or  LS ,  and  may  result  in  decreased  sensitivity  to 
treatment  differences.  To  optimize  the  potential  of  the  ME 
procedure  for  the  descriptive  assessment  of  cooked  beef, 
some  procedural  modifications  may  be  needed.  The  ME  method 
may  be  most  useful  for  an  evaluation  of  just  a  few 
attributes  at  one  time.  However,  the  evaluation  of  products 
requiring  mastication  such  as  meat  may  always  require 
greater  effort. 
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SAMPLE  NO 


SCORECARD  FOR  BEEF 

JUDGE 

DATE 


5 

4 

3 

2 

1 

Internal 

Color 

Medium 

Well  Done 

Rare 

COMMENTS : 


Figure  13.  Scorecard  for  the  degree  of  doneness  assessment 
of  cooked  semimembranosus  steaks. 
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Evaluate  the  palatability  characteristics  of  each  of 
the  beef  samples.  Circle  a  number  along  the  8-point  scale 
that  best . describes  your  impression  of  each  of  the 
characteristics.  Evaluate  each  sample  individually,  trying 
n Qt  to  make  comparisons  between  samples. 

Please  rinse  your  mouth  with  water  before  beginning  and 
between  samples.  Take  a  short  break  between  the  first  and 
second  set  of  samples. 


INITIAL  TENDERNESS-  is  the  lack  of  force  required  to  bite 
through  a  cube  of  beef  across  the  grain,  between  the  molar 
teeth.  Evaluate  initial  tenderness  after  two  chews. 

OVERALL  TENDERNESS-  is  the  amount  of  effort  and  time 
required  to  completely  masticate  a  cube  of  beef.  Record  the 
number  of  chews  required  to  completely  masticate  the  cube  on 
the  scorecard.  Then  refer  to  your  chew  range  card  to  rate 
the  beef  cube  for  overall  tenderness. 

CONNECTIVE  TISSUE-is  the  amount  of  residue  felt  during 
chewing  and  left  after  complete  mastication. 

OVERALL  JUICINESS-  is  the  amount  of  moisture  left  in  the 
mouth  after  complete  mastication. 

FLAVOR  INTENSITY-  is  the  amount  of  meaty  flavors  present  in 
the  mouth  after  complete  mastication.  Please  comment  if 
other  flavors  mask  the  meatiness  of  the  sample. 

COMMENTS-  Your  comments  about  each  sample  are  welcome  and 
would  be  very  helpful. 


Before  leaving  your  booth,  please 
check  to  insure  that  you  have 
completed  the  entire  scorecard. 


Thank  You! 


Figure  14.  Definitions  and  tasting  procedure  for  the  sensory 
evaluation  of  beef  using  the  category  scale. 
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DEFINITIONS  AND  TASTING  PROCEDURE 

FOR  THE 

SENSORY  EVALUATION  OF  BEEF 


Evaluate  the  palatability  characteristics  of  each  of 
the  beef  samples.  Place  a  vertical  line  on  the  horizontal 
line  at  the  point  that  best  describes  your  impression  of 
each  of  the  characteristics.  Evaluate  each  sample 
individually,  trying  not  to  make  comparisons  between 
samples . 

Please  rinse  your  mouth  with  water  before  beginning  and 
between  samples.  Take  a  short  break  between  the  first  and 
second  set  of  samples. 


INITIAL  TENDERNESS-  is  the  lack  of  force  required  to  bite 
through  a  cube  of  beef  across  the  grain,  between  the  molar 
teeth.  Evaluate  initial  tenderness  after  two  chews. 

OVERALL  TENDERNESS-  is  the  amount  of  effort  and  time 
required  to  completely  masticate  a  cube  of  beef. 

CONNECTIVE  TISSUE-  is  the  amount  of  residue  felt  during 
chewing  and  left  after  complete  mastication. 

OVERALL  JUICINESS-  is  the  amount  of  moisture  left  in  the 
mouth  after  complete  mastication. 

FLAVOR  INTENSITY-  is  the  amount  of  meaty  flavors  present  in 
the  mouth  after  complete  mastication.  Please  comment  if 
other  flavors  mask  the  meatiness  of  the  sample. 


COMMENTS-  Your  comments  about  each  sample  are  welcome  and 
would  be  very  helpful. 

Before  leaving  your  booth,  please 
check  to  insure  that  you  have 
completed  the  entire  scorecard. 

Thank  You! 


Figure  15.  Definitions  and  tasting  procedure  for  the  sensory 
evaluation  of  beef  using  the  line  scales. 
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Evaluate  the  palatabilty  characteristics  of  each  of  the 
beef  samples.  Assign  any  numbers  you  wish  to  describe  each 
characteristic  of  the  first  sample  in  each 


.  .  *---  ---  -  set.  For  each 

successive  sample,  assign  numbers  for  each  characteristic 
relation  to  the  sample  you  had  just  evaluated. 


in 


Please  rinse  your  mouth  with  water  before  beginning  and 
between  samples.  Take  a  short  break  between  the  first  and 
second  set  of  samples. 


INITIAL  TENDERNESS-  is  the  lack  of  force  required  to  bite 
through  a  cube  of. beef  across  the  grain,  between  the  molar 
teeth.  Evaluate  initial  tenderness  after  two  chews. 

FLAVOR  INTENSITY-  is  the  amount  of  meaty  flavors  present  in 
the  mouth  after  complete  mastication.  Please  comment  if 
other  flavors  mask  the  meatiness  of  the  sample. 

OVERALL  JUICINESS-  is  the  amount  of  moisture  left  in  the 
mouth  after  complete  mastication. 

OVERALL  TENDERNESS-  is  the  amount  of  effort  and  time 
required  to  completely  masticate  a  cube  of  beef. 

CONNECTIVE  TISSUE-  is  the  amount  of  residue  felt  during 
chewing  and  left  after  complete  mastication. 

COMMENTS-  Your  comments  about  each  sample  are  welcome  and 
would  be  very  helpful. 


Before  leaving  your  booth,  please 
check  to  insure  that  you  have 
completed  the  entire  scorecard. 


Thank  you! 


Figure  16.  Definitions  and  tasting  procedure  for  the  sensory 
evaluation  of  beef  using  magnitude  estimation. 
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NAME 


RE 


DATE 


I  would  like  some  information  on  the  scaling  technique 
that  you  have  been  using  for  the  past  six  days.  Please 
answer  each  question  as  fully  as  possible. 

1.  Please  comment  on  the  following: 

any  difficulties  you  had  in  learning  this  scale 


-the  length  of  training  given  for  learning  this  scale 


the  applicability  of  this  scale  for  the  sensory 
evaluation  of  beef 


-the  effort  needed  to  evaluate  each  sample 


-your  impression  of  the  accuracy  provided  by  this  scale 


2.  List  several  desirable  and  undesirable  features  of  this 
scale . 

desirable  undesirable 


Figure  17.  Scorecard  for  ranking  the  three  sensory 
evaluation  techniques. 
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NAME 


DATE 


RE:  MAGNITUDE  ESTIMATION,  8-POINT  SCALE,  LINE  SCALE 

Please  compare  the  scaling  techniques  used  in  this  study. 
Answer  the  following  questions  by  ranking  the  three  scales 
Consider  each  question  separately. 


2. 


3. 


4. 


ease  of  learning 

RANK 

SCALE 

easiest  to  learn 

1 

most  difficult  to  learn 

2 

3 

applicability  of  the  scale 
of  beef 

for  the  sensory 

evaluat i on 

RANK 

SCALE 

most  applicable 

1 

least  applicable 

2 

3 

your  impression  of  scale  accuracy 

RANK 

SCALE 

most  accurate 

1 

2 

least  accurate 

3 

effort  needed  to  evaluate 

each  sample 

RANK 

SCALE 

most  effort 

1 

2 

least  effort 

3 

scale  preference 

RANK 

SCALE 

most  preferred 

1 

2 

least  preferred 

3 

COMMENTS 


Thank  you  for  your  co-operation 


Figure  18.  Questionaire  for  panel i sts ' comment s  regarding 
the  three  sensory  evaluation  techniques. 
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Table  16.  Means  and  standards  errors  for  chemical  and 

cooking  data  of  semimembranosus  steaks  from  the 
four  postmortem  meat  treatments. 


Postmortem 

Treatment 1 

Measurement 

DC 

DC-ES 

HB  HB-ES 

SEM2 

Chemical  data3 


PH 

5.5 

5.6 

5.6 

5.6 

0.07 

Ether  extract,  % 

1.3 

1  .0 

1  .4 

1  .  1 

0.17 

Total  moisture,  % 

73.2 

73.8 

73.7 

73.7 

0.24 

Cooking  data4 

Raw  weight,  g 

457.7 

432.4 

449.6 

443.  1 

49.43 

Drip  in  thaw,  % 

5.6b 

6.0b 

7.8a 

7 .  1  ab 

0 . 59** 

Initial  temp. , °C 

4.8 

4.6 

6.0 

5.3 

0.60 

Final  temp. ,  °C 

65.  1 

65.  1 

65.  1 

65.  1 

0.11 

Cooking  time,  min 

55.3 

57.0 

58.4 

57.0 

1  .94 

Cooking  time, 
min/1 OOg 

12.2 

13.7 

13.3 

13.1 

0.96 

Cooking  losses,  % 
Total 

23.  1 

23.7 

22.  1 

21.5 

1  .22 

Volatile 

20.  1 

20.7 

19.2 

18.9 

0.92 

Drip 

3.0 

3.  1 

2.9 

2.6 

0.40 

’Delay  chilling  (DC),  delay  chilling  with  electrical 
stimulation  (DC-ES),  hot  boning  (HB),  hot  boning  with 
electrical  stimulation  (HB-ES). 

2Standard  error  of  the  mean. 

3Values  are  the  means  of  18  determinations,  2  per  replicate. 

4Values  are  the  means  of  27  determinations,  1  per  steak 
per  replicate. 

a , b  Means  within  the  same  row  sharing  a  common  superscript 
are  not  significantly  different  at  P<0.05. 


**  Significant  at  P<0.01. 
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Table  17.  Means  and  standards  errors  for  the  degree  of 

doneness  evaluation  and  objective  measurements  for 
semimembranosus  steaks  from  the  four  postmortem 
meat  treatments. 


Postmortem  Treatment1 


Measurement 

DC 

DC-ES 

HB 

HB-ES 

SEM2 

Degree  of  doneness3 

4.9b 

5 .  1  ab 

5 . 6a 

5 . 5ab 

0.30* 

Hunter  /_ 4 

44.9 

44.  1 

42.6 

43.0 

1.11 

a 

6.8 

7.8 

8.4 

7.5 

0.45 

b 

10.4 

10.5 

10.5 

10.3 

0.19 

Press  fluid,  % 5 

36.5b 

38 . 2ab 

40.1a 

39 . 2ab 

1  .08* 

OTMS-Warner 

Bratzler  shear, 

kg  6 

7.7 

7.5 

8.7 

8.6 

0.45 

’Delay  chilling 

(DC)  , 

delay 

chilling 

with 

electrical 

stimulation  (DC-ES),  hot  boning  (HB),  hot  boning  with 
electrical  stimulation  (HB-ES). 

2Standard  error  of  the  mean. 

3Doneness  scale:  1=well-done,  5=medium,  9=rare.  Values 
are  the  means  of  81  judgements,  one  per  steak  by  each 
of  three  panelists. 

'Values  are  the  means  of  54  determinations,  2  on  each  of 
three  steaks  per  replication. 

5Values  are  the  means  of  54  determinations,  3  on  each  of 
two  steaks  per  replication. 

6 Values  are  the  means  of  156  determinations,  with  an 
average  of  9  shears  on  each  of  two  steaks  per  replication. 

a,b  Means  within  the  same  row  sharing  a  common  superscript 
are  not  significantly  different  at  P<0.05. 


*  Significant  at  P<0.05. 
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Table  18.  Means  and  standard  errors  for  category  scaling, 
me  scaling  and  magnitude  estimation  of  semi¬ 
membranosus  steaks  from  the  four  postmortem  meat 
treatments . 


Postmortem  Treatment1 


Attribute  2 

DC 

DC-ES 

HB 

HB-ES 

SEM3 

Initial  tenderness 

Category  scale 

5.4 

4.9 

4.5 

5.0 

0 .33 

Line  scale 

9.8 

8.5 

8.2 

8.7 

0 . 58 

Magnitude  estimation 

13.7 

16.1 

15.5 

14.6 

(0.041  )  4 

Overall  tenderness 

Category  scale 

5.  1 

4.5 

4.2 

4.6 

0 .30 

Line  scale 

8.9 

7.8 

7.6 

8.  1 

0.53 

Magnitude  estimation 

16.3 

18.5 

20.0 

17.3 

(0.028) 

Connective  tissue 

Category  scale 

4.9 

4.9 

4.6 

5.0 

0.16 

Line  scale 

9.7 

9.8 

9.2 

9.7 

0.37 

Magnitude  estimation 

10.0 

10.2 

11.5 

9.8 

(0.024) 

Overall  juiciness 

Category  scale 

4.7b 

5.1a 

5 . 0a 

5.1a 

0 .09* 

Line  scale 

7.7 

8.  1 

8.3 

8.3 

0.25 

Magnitude  estimation 

12.3b 

12.9b 

14.4a 

14.7a 

(0.015)** 

Flavor  intensity 

Category  scale 

5.0 

5.2 

5.0 

5.2 

0.11 

Line  scale 

7.9 

8.0 

8.0 

8.2 

0.23 

Magnitude  estimation 

13.9 

14.4 

14.4 

15.7 

(0.015) 

’Delay  chilling  (DC),  delay  chilling  with  electrical 
stimulation  (DC-ES),  hot  boning  (HB),  hot  boning 
with  electrical  stimulation  (DC-ES). 

2For  a  description  of  each  evaluation  technique,  see 

Table  7,  page  65. 

3Standard  error  of  the  mean. 

4Standard  error  expressed  as  log  10  value. 

*,  **  Significant  at  P<0.05  and  P<0.01,  respectively. 
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Table  19.  F  values  for  selected  sources  of  variation  from 
the  analyses  of  variance  of  category  scale,  line 
scale  and  magnitude  estimation  data  for  the  four 
postmortem  treatments. 


Source  of 

Variation  1 

Attribute 

T 

R 

P 

TxP 

Initial  tenderness 

Category  scaling 

1.40 

0.47 

9.34*** 

0.63 

Line  scaling 

1.34 

0.50 

18.63*** 

0.85 

Magnitude  estimation 

0.55 

0.43 

32.29*** 

0.81 

Overall  tenderness 

Category  scaling 

1  .54 

1  .07 

10.63*** 

0.68 

Line  scaling 

1  .25 

0.81 

17.98*** 

0.51 

Magnitude  estimation 

1  .74 

0.99 

67 . 96*** 

1  .06 

Connective  tissue  amt. 

Category  scaling 

1.32 

1.19 

10.03*** 

0.76 

Line  scaling 

0.57 

0.27 

20.91*** 

0.57 

Magnitude  estimation 

1  .69 

2.47* 

34.81*** 

0.88 

Overall  juiciness 

Category  scaling 

4.33* 

3.61** 

10.31*** 

0.96 

Line  scaling 

1  .40 

1  .02 

14. 14*** 

0.87 

Magnitude  estimation 

5.79** 

4.50** 

46.47*** 

1.21 

Flavor  intensity 

Category  scaling 

1.10 

2.45* 

1  5 .82*** 

0.99 

Line  scaling 

0.23 

2.29 

16.64*** 

1  .07 

Magnitude  estimation 

2.27 

3.  17* 

61.42*** 

0.85 

’Treatment  (T),  replication  (R),  panelist  (P)  and  treatment 
by  panelist  (TxP) . 

*,  **,  ***  Significant  at  P<0.05  and  P<0.01,  and  P<0.001, 
respectively. 
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Table  20.  Pearson  correlation  coefficients  (r)  and 

cofficients  of  determination  (r2)  between  panel 
assessments  from  each  evaluation  technique1 
(N=36 ) . 


Attribute 

r 

r2 

Initial  tenderness 

CS  vs  LS 

0.91*** 

0.82 

CS  vs  ME 

~0.87*** 

0.76 

LS  vs  ME 

-0.91*** 

0.82 

Overall  tenderness 

CS  vs  LS 

0.91*** 

0.83 

CS  vs  ME 

-0.89*** 

0.79 

LS  vs  ME 

-0.87*** 

0.76 

Connective  tissue  amount 

CS  vs  LS 

0.61*** 

0.37 

CS  vs  ME 

-0.52*** 

0.27 

LS  vs  ME 

“0.58*** 

0.34 

Overall  juiciness 

CS  vs  LS 

0.48*** 

0.23 

CS  vs  ME 

0 . 59*** 

0.34 

LS  vs  ME 

0.53*** 

0.29 

Flavor  intensity 

CS  vs  LS 

0.54*** 

0.29 

CS  vs  ME 

0.53*** 

0.28 

LS  vs  ME 

0.44** 

0.20 

’Category  scaling  (CS),  line  scaling  (LS)  and  magnitude 
estimation  (ME ) . 

**,  ***  Significant  at  P<0.01  and  P<0.001,  respectvely. 
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Table  22.  Pear son _ cor relat ion  coefficients  (r)  and 

coefficients  of  determination  (r2)  between  panel 
tenderness  assessments  from  each  evaluation 
technique1  and  Warner  Bratzler  shear  data  (N=36). 


Attribute 

r 

r  2 

Initial  tenderness 

CS 

-0 . 68*** 

0.47 

LS 

“0.70*** 

0.49 

ME 

0 . 6  6  *  *  * 

0.44 

Overall  tenderness 

CS 

“0.70*** 

0.49 

LS 

“0.73*** 

0.54 

ME 

0.68*** 

0.47 

’Category  scaling  (CS), 

line  scaling  (LS)  and 

magn i tude 

estimation  (ME) . 

***  Significant  at  P<0.001. 
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